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In this article, we review some of the complexities of jet algorithms and of the resultant compar- 
isons of data to theory. We review the extensive experience with jet measurements at the Tevatron, 
the extrapolation of this acquired wisdom to the LHC and the differences between the Tevatron 
and LHC environments. We also describe a framework (SpartyJet) for the convenient comparison 
of results using different jet algorithms. 
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1 Introduction 

Most of the interesting physics signatures at the Tevatron and LHC involve final states with jets of 
hadrons. A jet is reconstructed from energy depositions in calorimeter cells and/or from charged 
particle track momenta, and ideally is corrected for detector response and resolution effects so that 
the resultant 4-vector corresponds to that of the sum of the original hadrons comprising the jet. The 
jets can also be further corrected, for hadronization effects, back to the parton(s) from which the jet 
originated. The resultant measurements can be compared to predictions from parton shower Monte 
Carlos. If further corrections are made to account for the showers, or if these corrections are assumed 
to be small, comparisons can be made directly to the short distance partons described by fixed order 
perturbative calculations. 

In order to actually reconstruct a jet, and make comparisons between data and theoretical predic- 
tions, a precise definition of the jet is required. The definition is presented in the form of a jet algorithm. 
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Jet algorithms cluster partons, or particles or calorimeter towers based on proximity in coordinate space 
(as for example in cone algorithms) or proximity in momentum space (as for example in kx algorithms). 
For a precise comparison of experiment to theory, it is advantageous for a jet algorithm to provide a 
similar description of a hard scatter event regardless if it is applied at the detector, hadron or parton 
level. 

In this article, we will review some of the complexities of jet algorithms and of the resultant compar- 
isons of data to theory. We will review the extensive experience with jet measurements at the Tevatron, 
the extrapolation of this acquired wisdom to the LHC and the differences between the Tevatron and 
LHC environments. We will also discuss ways in which the jet algorithm systematics can be reduced 
to the percent level, an important goal for LHC analyses. Finally we will describe a framework (Spar- 
tyJet) for the convenient comparison of results using different jetalgorithms. Several of the authors 
are members of CDF and ATLAS and we apologize in advance for our concentration on those experi- 
ments. Given the restrictions of space, we will not try a comprehensive review of physics with jets at 
hadron-hadron colliders, but instead refer the reader to a recent review [1] . 



proton 



underlying 




Figure 1: Pictorial representation of a hard scattering event [2]. 



2 Factorization 

The fundamental challenge when trying to make theoretical predictions or interpret experimentally 
observed final states is that the theory of the strong interactions (QCD) is most easily applied to the 
short distance (<C 1 fermi) degrees of freedom, i.e., to the color-charged quarks and gluons, while the 
long distance degrees of freedom seen in the detectors are color singlet bound states of these degrees 
of freedom. We picture the overall scattering process, as pictorially displayed in Fig. [T] [2] , evolving 
from the incoming long distance hadrons in the beams to the short- distance scattering process to the 
long distance outgoing states, as occurring in several (approximately) distinct steps. The separation, 
or factorization, of these steps is essential both conceptually and calculationally. It is based on the 
distinct distance (or momentum) scales inherent at each step. 

We imagine as a first step picking out from the incident beam particles the short distance partons 
(defined by an appropriate factorization scale) that participate in the short distance scattering. The 
relative probability to find the scattering partons at this step is provided by the parton distribution 
functions (pdfs), which are functions of the partons' color and flavor, the longitudinal momentum frac- 
tions Xk carried by the partons and the factorization scale /i, all of which serve to uniquely define the 
desired partons. The pdfs are themselves determined from global fits [3] to a wide variety of data. 
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EVENT 



HADRON-HADRON COLLISION 


i Primarv (Hard) Parton-Parton Scatterina 




Fraamentation 


\ \ 1 nitial -State Radiation (IS R) = Spacelike Showers 
i associated with Hard Scattering 




Perturbative: 

Final-State Radiation 
(FSRi 

= Timelik8 Shoivgrs 
= Jet Broadenina and 


Non-perturbative: 

String / Cluster 

Hadronisation 

(Colour RGConnections''] 


Underlvina Event 




1 i i MultiDle Parlon-Parton Interactions: Additional 
1 parton-parton collisions (in principle with 
j showers etc) in the same hadron-hadron 




Hard Final-State 
Bremsstrahluna 




1 i collision. 








1 = Multiple Perturbative Interactions (M PI) 








i = Spectator Interactions 








j Beam Remnants: Lett over hadron remnants irom the incominq beams. 
1 Coloured and hence correllated with the rest ot the event 





PILE-UP: Additional hadron-hadron collisions recorded as part ot the same 
event. 



Figure 2: A dictionary of hadron collider terms relating to jet measurements [6]. 

all of which can be analyzed in the context of perturbative QCD (pQCD) essentially as outlined here. 
The partons selected in this way can emit radiation prior to the short distance scattering yielding the 
possibility of initial state radiation (ISR). The remnants of the original hadrons, with one parton re- 
moved, are no longer color singlet states and will interact, presumably softly, with each other generating 
(approximately incoherently from the hard scattering) an underlying distribution of soft partons, the 
beginning of the underlying event (UE). 

Next comes the short distance, large momentum transfer scattering process that may change the 
character of the scattering partons and/or produce more partons (or other interesting particles). The 
cross section for this step is evaluated at fixed order in pQCD, presumably to next-to-leading-order 
(NLO), or higher. Then comes another color radiation step, when many new gluons and quark pairs 
are added to the state (final state radiation or FSR), dominated by partons that have low energy and/or 
are nearly collinear with the scattered short distance partons. The FSR, like the ISR, is described by 
the showers in the Monte Carlo programs [H [5] and calculated probabilistically in terms of summed 
leading logarithm perturbation theory. 

The final step in the evolution to long distance states involves a non-perturbative hadronization pro- 
cess that organizes the colored degrees of freedom from the showering and from the softer interactions 
of other initial state partons (the UE is simulated in terms of beam-beam remnants and multiple par- 
ton interactions) into color-singlet hadrons with physical masses. The non-perturbative hadronization 
process yields a collection of ground state hadrons (primarily pions) and resonances {Ai, A2,etc), and 
the resonances then decay into lighter hadrons such as pions. The masses of the resonances result in the 
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decay pions being produced with non-zero transverse momenta with respect to the momentum direction 
of the original resonance (but still small compared to typical jet momenta). The details are determined 
from data but the decay modes for some of the higher mass resonances are not well-understood leading 
to uncertainties (and differences between Monte Carlo programs) in the details of the production of the 
final state hadrons. A convenient dictionary of the terms described above is summarized in Fig. [2] [6] . 
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Figure 3: A representation of the stages of jet production and measurement. 

The hadronization step is accomplished in a model dependent fashion [i.e., a different fashion) in 
different Monte Carlos. 

The union of the showering and the hadronization steps is what has historically been labelled as 
fragmentation, as in fragmentation functions describing the longitudinal distribution of hadrons within 
final state jets. In practice, both the radiation and hadronization steps tend to smear out the energy 
that was originally localized in the individual short distance partons (a "splash-out" effect), while 
the contributions from the underlying event (and any "pile-up" from multiple hadron collisions in the 
same data-taking time interval) add to the energy originally present in the short distance scattering (a 
"splash-in" effect). Finally the hadrons, and their decay products, are detected with finite precision in 
a detector. This final step of the jet components interacting in the detector is represented in Fig. [31 

The separation of this complicated scattering process into distinct steps is not strictly valid in 
quantum mechanics where interference plays a role; we must sum the amplitudes before squaring them 
and not just sum the squares. However, some features of this factorization can be rigorously established 
[7], and the numerical dominance of collinear QCD radiation ensures that the simple picture presented 
here and quantified by Monte Carlo generated events, with interference only approximately represented 
in the structure of the showers, e.g., the angles of emission of partons in the showers are monotonically 
ordered, provides a reliable first approximation. 

In order to interpret the experimentally detected long distance objects, the charged particles and 
energy depositions in calorimeter cells, in terms of the underlying short distance physics, the partons, 
jet algorithms are employed to associate "nearby" experimental objects into jets. The underlying 
assumption is that the kinematics (energy and momentum) of the resulting cluster or jet provides a 
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useful measure of the kinematics (energy and momentum) of the underlying, short- distance partons. 
The goal is to characterize the short-distance physics, event-by-event, in terms of the discrete jets found 
by the algorithm. In particular, we assume that the basic mismatch between colored short-distance 
theory objects and the colorless long-distance experimental objects does not present an important 
numerical limitation, compared to our goal of percent level accuracy for jet algorithm effects. We 
assume that we are either insensitive to or can reliably correct for the effects of the UE, ISR, FSR and 
hadronization. We assume that most relevant features of a jet can be described by the (up to) 2 partons 
per jet present in a NLO calculation. We will investigate the features that are not so well- described. 

As noted, jet algorithms rely on the merging of objects that are, by some measure, nearby each 
other. This feature is essential in perturbation theory, where the divergent contributions from virtual 
diagrams must contribute in exactly the same way, i.e., contribute to the same kinematic bins, as 
the divergent contributions from soft and collinear real emissions, in order that these contributions 
can cancel. It is only through this cancellation that jet algorithms serve to define an IR-safe (finite) 
quantity, i.e., a quantity that is insensitive to the emission of extra soft and/or collinear partons. The 
standard measures of "nearness" (see [S]) include pair- wise relative transverse momenta, as in the 
algorithm, or angles relative to a jet axis, as in the cone algorithm. By definition a "good" algorithm 
yields stable {i.e., very similar) results whether it is applied to a state with just a few partons (as in 
NLO perturbation theory), a state with many partons (after the parton shower as simulated in a Monte 
Carlo), a state with hadrons (as simulated in a Monte Carlo including a model for the hadronization step 
and the underlying event), or applied to the observed tracks and energy deposition in a real detector. 
As we will see, this requirement constitutes a substantial challenge. Further, it is highly desirable 
that the identification of jets be insensitive to the contributions from the simultaneous uncorrelated 
soft collisions that occur during pile-up at high luminosity. Finally, we want to be able to apply the 
same algorithm (in detail) at each level in the evolution of the hadronic final state. This implies that 
we must avoid components in the algorithm that make sense when applied to data but not to fixed 
order perturbation theory, or vice versa. This constraint will play an important role in our subsequent 
discussion. 

In practice, we can think of the jet algorithm as a set of mathematical rules that detail how to 
carry out two distinct steps. The first step operates, event-by-event, on the list of 4- vectors, which 
describes either the perturbative final state, the final-state hadrons in the MC simulated event or the 
output from the detector, to turn the original list into a set of sublists, one sublist for each jet (plus the 
beam jets). The second step specified by the algorithm tells us how to construct appropriate kinematic 
quantities from each sublist in order to describe the kinematic properties of the individual jets. Both 
steps depend on the specific jet algorithm, as will be illustrated in detail in the next sections for specific 
jet algorithms, and have varied over time. In particular, early applications of jets did not employ true 
4-vector arithmetic and largely ignored the information carried by the invariant mass of the jet. In 
Run II at the Tevatron and at the LHC true 4-vector arithmetic is and will be employed (the so-called 
"E-scheme" as recommended for Run II in [S]). The corresponding kinematic variables describing the 
jets include the (true) transverse momentum, pt, the rapidity, y = llog-f^^, the azimuthal angle, 0, 
and the invariant mass of the jet, Mj. We will have more to say about the history of different choices 
of kinematic variables in the next section. 

For many events, the jet structure is clear and the jets, into which the individual calorimeter towers 
should be assigned, are fairly unambiguous. However, in other events, such as the lego plot of a CDF 
event shown in Fig. HI the complexity of the energy depositions means that different algorithms will 
result in different assignments of towers to the various jets. This is not a problem if a similar complexity 
is exhibited by the theoretical calculation which is to be compared to the data. However, the most 
precise and thoroughly understood theoretical calculations arise in fixed order perturbation theory, 
which can exhibit only limited complexity, e.g., at most 2 partons per jet at NLO. On the other hand, 
for events simulated with parton shower Monte Carlos the complexity of the final state is more realistic, 
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but the intrinsic theoretical uncertainty is larger. Correspondingly the jets identified by the algorithms 
vary if we compare at the perturbative, shower, hadron and detector levels. Thus it is essential to 
understand these limitations of jet algorithms and, as much as possible, eliminate or correct for them 
to approach our percent level goal. It is the aim of the following review to highlight the issues that 
arose during Runs I and II at the Tevatron, discuss their current understanding, and outline possible 
preventative measures for the LHC [6]. 



Figure 4: Impact of different jet clustering algorithms on an interesting CDF event taken in Run II. 
The segmentation in 77 and shown the lego plot corresponds to the calorimeter segmentation. Energy 
depositions in the electromagnetic portion of the calorimeter are colored red and those in the hadronic 
section are colored blue. The numbers in the figure are transverse momenta of the jets pointed to by 
the arrows, and different colors represent jets clustered by different algorithms. 



3 Jets: Parton Level vs Experiment 

3.1 Iterative Cone Algorithm 
3.1.1 Definitions 

To illustrate the behavior of jet algorithms consider first the original Snowmass implementation of 
the iterative cone algorithm [9]. The first step in the algorithm, i.e., the identification of the sublists 
of objects corresponding to the jets, is defined in terms of a simple sum over all (short distance or 
long distance) objects within a cone centered at rapidity (the original version used the pseudorapidity 
rj = \ log(cot |)) and azimuthal angle {yc, 0c)- Using the objects in the cone we can define ap^-weighted 




centroid via 




Vc = 



Efccc Vk ■ PT,k 



'c = 



J2lccPT,l 
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If the pT-weighted centroid does not coincide with the geometric center of the cone, (jjq, ^c-) 7^ {yc, (f>c), 
a cone is placed at the p^-weighted centroid and the calculation is repeated. This simple calculation 
is iterated until a "stable" cone is found, {ycy4>c) ~ iycy4>c)y which serves to define the jet (and the 
name of this algorithm as the iterative cone algorithm). Thus, at least in principle, one can think in 
terms of placing trial cones everywhere in {y, 0) and allowing them to "flow" until a stable cone or jet 
is found. This flow idea is illustrated in Fig. [5l where a) illustrates the LEGO plot for a simple (quiet) 
Monte Carlo generated event with 3 apparent jets and b) shows the corresponding flows of the trial 
cones towards the (obvious) final jets. Compared to the event in Fig. H] there is little ambiguity in this 
event concerning the jet structure. 




Figure 5: (An ideal) Monte Carlo generated event with 2 large energy jets and 1 small energy jet in the 
LEGO plot a), and the corresponding flow structure of the trial cones in b). 

To facilitate the subsequent discussion and provide some mathematical structure for this image of 
"flow" we can define the "Snowmass potential" in terms of the 2-dimensional vector "r* = (?/, 0) via 

V (T^) = -I 5^PT,. {nine - i^k - ^)') e (/?Le ' k ' T^f) , 

where the function defines the objects inside the cone centered at r. The flow described by the 
iteration process is driven by the "force" 

F {V) = -vv iV) = J2pT,k {r, - r) e (i?Le - i^k - rf) 

k 

= (^C( T^) Yl PT,k. 

fcCC(r) 



where r ^^-^^ = {^y^,^-,y(f)^^-,^j and k C C {V) is defined by y (y^ - y)^ + (0^ - 0)^ < Rcone- As 

desired, this "force" pushes the cone to the stable cone position, i.e., the minimum of the Snowmass 
potential. As noted above the current Run II analyses and those expected at the LHC, described 
in more detail below, 4-vector techniques are used and the corresponding E-scheme centroid is given 
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instead by 



A; C C iff V {yu - ycf + {4>k - <Pcf < Rc 



PC = {Ec, ~pc) = ^^k^ Pk) , Vc 

kcC 



1 Ec+ Pz,c 
- Ill 

2 Ec-p,,c 



tan 



-1 Py,c 
Px,c' 



In the NLO perturbative calculation these changes in definitions result in only tiny numerical changes, 
compared to the Snowmass definition numbers. 




°-9d.O 0.4 0.8 1.2 1.6 




Figure 6: Perturbative 2-parton phase space: z = Pt,2/pt,i {pt,i > Pt,2), d = y ivi — + (0i — <p2f 
for a) the naive Rsep = 2 case and b) for Rsep = 1-3 case suggested by data. 

As an introduction to how the iterative cone algorithm works, consider first its application to NLO 
level in perturbation theory (see, e.g., [10]), where there are at most 2 partons in a cone. As defined 
above, the cone algorithm specifies that two partons are included in the same jet (z.e., form a stable cone) 
if they are both within Rcom [e.g., 0.7 in (y, 0) space) of the centroid, which they themselves define. 
This means that 2 partons of equal pt can form a single jet as long as their pair-wise angular separation 
does not exceed the boundaries of the cone, AR = 2Rcone- On the other hand, as long as AR > Rcom, 
there will also be stable cones centered around each of the partons. The corresponding 2-parton phase 
space for Rcone = 0.7 is illustrated in Fig. [6] a) in terms of the ratio z = Pt,2/pt,i {pt,i > Pt,2) and the 

angular separation variable d = \J {yi — 1/2)^ + {4>i — 02)^- To the left of the line d = Rcone the two 
partons always form a single stable cone and jet, while to the far right, d > 2Rcone, there are always two 
distinct stable cones and jets, with a single parton in each jet. More interesting is the central region, 
Rcone < d < 2Rcone, which exhibits both the case of two stable cones (one of the partons in each cone) 
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Figure 7: 2-parton distibution in [d, z) in a) with d = 1.0, z = 0.6 and the corresponding energy-in-cone, 
Ec (r), and potential, V (r). 



and the case of three stable cones (the previous two cones plus a third cone that includes both partons). 
The precise outcome depends on the specific values of z and d. (Note that the exactly straight diagonal 
boundary in the figure corresponds to the pr-weighted definition of the Snowmass algorithm, but is only 
slightly distorted, < 2%, when full 4-vector kinematics is used in the Run II algorithms.) To see the 
three stable cone structure in terms of the 2-parton "Snowmass potential" consider the point z = 0.6 
and d = 1.0, which is in the 3 cones 1 jet region. This configuration, in terms of the 2-parton 
coordinates z and d, is illustrated in Fig. [7^. The corresponding energy in a cone, Ec, normalized to 
the energy of the more energetic parton, is illustrated by the (red) dashed line in Fig. [Tb, where the 
location of the cone center, r, is constrained to lie in the plane of the 2 partons with the origin at 
the location of the more energetic parton. The solid (green) curve in Fig. [7)d shows the corresponding 
"Snowmass potential", again normalized to the energy of the more energetic parton, versus the same 
1-D location radius r. This potential exhibits the expected 3 minima corresponding to a stable cone 
at each parton (r = and r = d = 1.0) and a third stable cone, central between the other two, that 
includes the contributions from both partons. A relevant point is that the central minimum is not 
nearly as deep (i.e., as robust) as the other two. As we shall see, this minimum often does not survive 
the smearing inherent in the transition from the short distances of fixed order perturbation theory to 
the long distances of the physical final state. As indicated by the labeling in Fig. El in the 3 stable 
cone region the original perturbative calculation [TU] kept as the jet the 2-in-l stable cone, maximum 
Pt configuration, i.e., the cone that included all of the energy in the other two cones consistent with 
the merging discussion below. 

As we will see, much of the concern and confusion about the cone algorithm arises from the treatment 
of this 3 stable cone region. It is intuitively obvious that, as the energy in the short-distance partons 
is smeared out by subsequent showering and hadronization, the detailed structure in this region is likely 
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to change. In particular, while two nearly equal pt partons separated by nearly 2Rcone niay define a 
stable cone in fixed order pQCD, this configuration is unlikely to yield a stable cone after showering 
and hadronization. 

Having performed the first step in the algorithm to identify the particles in the cone C, the second 
step is to define the kinematic variables describing the jet. As suggested above the the Snowmass 
definition of the iterative cone algorithm used angular variables defined by the pt weighted expressions 



yj 



'Ekcc Vk ■ PT,k 



and, instead of the true transverse momentum, the scalar transverse momentum 



Pj = Pt,s 



nowmass ^ ^ PT,1 
ICC 



In the (recommended) case of 4-vector arithmetic we have instead (as suggested as above) 



Pc = {Ec,~P 



c) 



Pc,t\ 



kcc 



^\ _ 1 1 + Pz,C , 

Pk) , yj = - \n — , 0j 

2 Ec- Pz,c 



-1 Pv,c 

Px,C 



kcc 



,m] = pI = eI-^1. 



During Run I CDF used an intermediate set of definitions with I3-scheme (4-vector) angles and the jet 
"transverse momentum" given by the transverse energy 

Pj = Et = Ec sin 9c- 



In the next section we will briefiy explore the quantitative differences between these definitions in the 
context of NLO perturbation theory. 



3.1.2 Rsep: Seeds and IR-Sensitivity 

Iterative cone algorithms similar to the ones described in the previous section were employed by both the 
CDF and D0 collaborations during Run I with considerable success. There was fairly good agreement 
with NLO pQCD for the inclusive jet cross section over a dynamic range of order 10®. During Run 
I, the data were corrected primarily for detector effects and for the contributions of the underlying 
event. In fact, a positive feature of the cone algorithm is that, since the cone's geometry in [y, (p) space 
is (meant to be) simple, the correction for the "splash-in" contribution of the (largely uncorrelated) 
underlying event (and pile-up) is straightforward. (As we will see below, the corrections being used 
in Run II are more sophisticated.) The uncertainties in both the data and the theory were 10% or 
greater, depending on the kinematic regime, and helped to ensure agreement. However, as cone jets 
were studied in more detail, various troubling issues arose. For example, it was noted long ago [TT] 
that, when using the experimental cone algorithms implemented at the Tevatron, two jets of comparable 
energy (taken from 2 different events in the data) are not merged into a single jet if they are separated 
by an angular distance greater than approximately 1.3 times the cone radius, while the simple picture 
of Fig. [H]a) suggests that merging should occur out to an angular separation of 2_Rcone- Independently 
it was also noted that the dependence of the experimental inclusive jet cross section on the cone radius 
Rcone [12] and the shape of the energy distribution within a jet [13] both differed discernibly from the 
NLO predictions (the data were less Rcone dependent and exhibited less energy near the edge of the 
cone). All three of these issues seemed to be associated with the contribution from the perturbative 
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configuration of two partons witli comparable Pt at opposite sides of the cone (2; ^ 1, ~ 2Rcone = 1.4 
in Fig. [H]a)) and the data suggested a lower contribution from this configuration than present in the 
perturbative result. To simulate this feature in the perturbative analysis, a phenomenological parameter 
Rsep was added to the NLO implementation of the cone algorithm [H]. In this "experiment-aware" 
version of the perturbative cone algorithm, two partons are not merged into a single jet if they are 
separated by more than Rsep ■ Rcone from each other, independent of their individual distance from the 
PT-weighted jet centroid. Thus, the two partons are merged into the same jet if they are within Rcone 
of the pT-weighted jet centroid and within Rsep ■ Rcone of each other; otherwise the two partons are 
identified as separate jets. In order to describe the observed Rcone dependence of the cross section and 
the observed energy profile of jets, the specific value Rsep = 1.3 was chosen (along with a "smallish" 
renormalization/factorization scale /i = pt/4:), which was subsequently noted to be in good agreement 
with the aforementioned (independent) jet separation study. The resulting 2 parton phase space is 
indicated in Fig. Elb). In the perturbative calculation, this redefinition, leading to a slightly lower 
average pt for the leading jet, lowers the NLO jet cross section by about 5% (for R = 0.7 and pt = 100 
GeV/c). It is important to recognize that the fractional contribution to the inclusive jet cross section 
of the merged 2 parton configurations in the entire wedge to the right of d = Rcone is only of order 
10% for jet Pt of order 100 GeV/c, and, being proportional to (pr), decreases with increasing pt- 
Thus it is no surprise that, although this region was apparently treated differently (in the NLO theory 
comparisons) by the cone algorithm implementations of CDF and D0 during Run I as discussed below, 
there were no relevant cross section disagreements above the > 10% uncertainties. Further, as we will 
discuss below, it is the variation in the treatment of this (effectively 10%) region of the 2-parton phase 
space that drives many of the differences between jet algorithms. 

While the parameter Rgep is ad hoc and thus an undesirable complication in the perturbative jet 
analysis, it will serve as a useful pedagogical tool in the following discussions. To illustrate this point 
quantitatively. Fig. [H] shows the dependence on Rsep for various choices of the jet momentum Pj at 
NLO in perturbation theory. The curves labeled Snowmass use the pt weighted kinematics described 
above with Pj given by the scalar sum of the transverse momenta of the partons in the cone. The 
two E-scheme algorithms use full 4-vector kinematics and Pj equal to either the magnitude of the 
true (vector sum) transverse momentum (the recommended choice), or the "transverse energy" defined 
by Pj = Ej- = EsinO (as defined by CDF in Run I). Thus this last variable knows about both the 
momentum and the invariant mass of the set of partons in the cone, which can be sizable for well 
separated parton pairs. The differences in the various ratios for different values of Rsep tell us about 
how the 2-parton configurations contribute. For example. Fig. [8] a) tells us that, since, for a given 
configuration of 2 partons in a cone, > PT,Snowmass > Pt, the cross sections at a given value of 
Pj will show the same ordering. Further, as expected, the differences are reduced if we keep only 
configurations with small angular separation, Rgep = 1. From Fig. [8]b) we confirm the earlier statement 
that lowering Rsep from 2 to 1.3 yields a 5% change for the Snowmass algorithm cross section with 
Pj = 100 GeV, while lowering it all the way to Rsep = 1, i-e., removing all of the triangular region, 
lowers the 100 GeV Snowmass jet cross section by approximately 12%. Figs. [8] c) and d) confirm 
that 4-vector kinematics with Pj = px exhibits the smallest sensitivity to Rsep, ^.e., to the 2-parton 
configurations in the triangle. The choice Pj = E^, with its dependence on the mass of the pair, 
exhibits the largest sensitivity to Rsep- These are all good reasons to use the recommended E-scheme 
kinematics with Pj = pt- The move to employ 4-vector kinematics for jet analyses in Run II and at 
the LHC is a positive step. It will allow the meaningful investigation of jet masses, which will likely be 
very useful at the LHC. The decay of large mass (few TeV) new particles will lead to highly boosted 
W's, Z's and top quarks that will be observed as single jets. Thus the mass of such jets may constitute 
a useful selection tool. 

The difference between the perturbative implementation of the iterative cone algorithm and the 
experimental implementation at the Tevatron, which is simulated by Rsep, is thought to arise from 
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Figure 8: Ratios of the NLO inclusive cone jet cross section versus the jet momentum for 3 definitions 
of the kinematics for various values of i?. 
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several sources. While the perturbative version (with R, 



sep = 2) analytically includes all 2-parton 
configurations that satisfy the algorithm (recall Fig. [6] a)), the experiments employ the concept of seeds 
to reduce the analysis time and place trial cones only in regions of the detector where there are seeds, 
i.e., pre-clusters with substantial energy. This procedure introduces another parameter, the lower 
Pt threshold defining the seeds, and also decreases the likelihood of finding the 2- showers-in-one- jet 
configurations corresponding to the upper right-hand corner of the 3 cones — >■ 1 jet region of Fig. [6] a) 
and the middle minimum in Fig. [7]b). Thus, the use of seeds contributes to the need for Rgep < 2. 
Perhaps more importantly, the desire to match the theoretical algorithm with the experimental one 
means that we should include seeds in the perturbative algorithm. This is accomplished by placing 
trial cones only at the locations of each parton and testing to see if any other partons are inside of these 
cones. Thus at NLO, 2 partons will be merged into a single jet only if they are closer than Rcone in 
[y, (j)) space. This corresponds to Rsep = 1-0 in the language of Fig. [6] and produces a larger numerical 
change in the analysis than observed, i.e., we wanted Rg^p — 1-3. More importantly at the next order 
in perturbation theory, NNLO, there are extra partons that can play the role of low energy seeds. The 
corresponding parton configurations are illustrated in Fig. |9l At NLO, or in the virtual correction in 
NNLO, the absence of any extra partons to serve as a seed leads to two distinct cones as on the left, 
while a (soft) real emission at NNLO can lead to the configuration on the right where the soft gluon 
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"seeds" the middle cone that includes all of the partons. The resulting separation between the NNLO 
virtual contribution and the NNLO soft real emission contribution [i.e., they contribute to different 
jet configurations) leads to an undesirable logarithmic dependence on the seed pt threshold [TSj. In 
the limit of an arbitrarily soft seed pr cutoff, the cone algorithm with seeds is no longer IR-safe. By 
introducing seeds in the algorithm, we have introduced exactly what we want to avoid, sensitivity to 
soft emissions. From the theory perspective, seeds are an undesirable component in the algorithm and 
should be eliminated. 




Figure 9: Two partons in two cones or in one cone with a (soft) seed present [8]. 



3.1.3 Seedless and Midpoint Algorithms 

The labeling of the Run I cone algorithm with seeds as Infrared Unsafe has led some theorists to suggest 
that the corresponding analyses should be disregarded. This is too extreme a reaction, especially since 
the numerical difference between the jet cross section found in data using seeds is expected to be less 
than a few percent different from an analysis using a seedless algorithm. A more useful approach 
will be to either avoid the use of seed^, or to correct for them in the analysis of the data, which 
can then be compared to a perturbative analysis without seeds. Note that it may seem surprising 
that an algorithm, which is Infrared Unsafe due to the use of seeds, leads to experimental results that 
differ from a Infrared Safe seedless algorithm by only a few percent. The essential point is that the 
lack of IR-safety is a property of the fixed-order perturbative application of the algorithm with seeds, 
not of the experimental application. In real data the additional soft components of the event (initial 
state radiation, final state radiation and the underlying event) ensure that there are seeds "nearly" 
everywhere. Thus there is only a small change from the situation where seeds are assumed to be 
everywhere (the seedless algorithm). In stark contrast the NLO perturbative application of an algorithm 
with seeds has only the energetic partons themselves to act as seeds. Thus there is a dramatic change 
at NNLO where the extra parton can serve as a seed, as in Fig. [9], changing the found jet structure 
of the event even when the extra parton is quite low energy. This is the source of the perturbative 
Infrared sensitivity. 

One of the main problems with the use of a seedless cone algorithm has been its slow speed with 
respect to the seeded cone algorithms. This has made its use in reconstruction of a large number of 
events difficult. Combined with the fact that, for inclusive distributions, the differences between the 
results from a seeded cone algorithm like Midpoint (defined below) and a seedless algorithm tend to be 
on the order of a percent or lesfl there was no strong motivation for its use. Recently, a new seedless 
algorithm (SISCone) [16| that has speeds comparable to the seeded cone algorithms has been developed, 
removing this difficulty. For this reason, the SISCone algorithm is being adopted by the experiments 

^The Run II recommendations [5] did include the suggestion of a seedless algorithm. 

^In Ref. [IS], a statement is made that the impact may be larger for some exclusive final state observables. 
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at both the Tevatron and LHClfl. Note that the problems with dark towers and the smearing of stable 
solution points (discussed later in Section I3.4.1I) still remain with a seedless algorithm. 

To address the issue of seeds on the experimental side and the Rsep parameter on the phenomeno- 
logical side, the Run II study [8] recommended using the Midpoint cone algorithm, in which, having 
identified 2 nearby jets, one always checks for a stable cone with its center at the midpoint between 
the 2 found cones. Thus, in the imagery of Fig. [9|, the central stable cone is now always looked for, 
whether there is an actual seed there or not. It was hoped that this would remove the sensitivity to 
the use of seeds and remove the need for the Rgep parameter. While this expectation is fully justified 
with the localized, short distance configuration indicated in Fig. [HI more recent studies suggest that at 
least part of the difficulty with the missing stable cones at the midpoint position is due to the (real) 
smearing effects on the energy distribution in {y, 0) of showering and hadronization. Also it is impor- 
tant to note that, in principle, IR-safety issues due to seeds will reappear in perturbation theory at 
order NNNLO, where the midpoint is not the only problem configuration (for example, a seed at the 
center of a triangular array of 3 hard and merge-able partons can lead to IR- sensitivity). Eliminating 
the use of seeds remains the most attractive option. 

3.1.4 Merging 

Before proceeding we must consider another important issue that arises when comparing the cone 
algorithm applied in perturbation theory with its application to data at the Tevatron. The practical 
definition of jets equaling stable cones does not eliminate the possibility that the stable cones can 
overlap, i.e., share a subset (or even all) of their calorimeter towers. To address this ambiguity, 
experimental decisions had to be made as to when to completely merge the two cones (based on the 
level of overlap), or, if not merging, how to split the shared energy. Note that there is only a weak 
analogy to this overlap issue in the NLO calculation. As described in Fig. E]a), there is no overlap 
in either the left-hand (1 cone — *• 1 jet) or right-hand (2 cones — *• 2 jets) regions, while in the middle 
(3 cones —>■ 1 jet) region the overlap between the 3 cones is 100% and the cones are always merged. 
Arguably the phenomenological parameter Rgep also serves to approximately simulate not only seeds 
but also the role of non-complete merging in the experimental analysis. In practice in Run I, CDF 
and D0 chose to use slightly different merging parameters. Thus, largely unknown to most of the 
theory community, the two experiments used somewhat different cone jet algorithms in Run I. The 
CDF collaboration cone algorithm, JETCLU [11], also employed another "feature" called ratcheting, 
that was likewise under-documented. Ratcheting ensured that any calorimeter tower in an initial 
seed was always retained in the corresponding final jet. Stability of the final cones was partially 
driven by this "no-tower-left-behind" feature. Presumably the two experiments reported compatible 
jet physics results in Run I due to the substantial (> 10%) uncertainties. Note also that, after the 
splitting/merging step, the resulting cone jets will not always have a simple, symmetric shape in [y, 0), 
which complicates the task of correcting for the underlying event and leads to larger uncertainties. In 
any case the plan for Run II as outlined in the Run II studies |8], called for cone jet algorithms in the 
2 collaborations as similar as possible. Unfortunately, during Run II the collaborations have evolved to 
employing somewhat different cone algorithms. On the merging question, CDF in Run II merges two 
overlapping cones when more than 75% of the smaller cone's energy overlaps with the larger jet. When 
the overlap is less, the overlap towers are assigned to the nearest jet. D0, on the other hand, uses 
a criterion of a 50% overlap in order to merge. There is anecdotal evidence from several studies that 
a merging criterion of 75% may be "safer" in high-density final states [T7], where there is a tendency 
for over-merging to occur. While it is not necessary that all analyses use the same jet algorithm, for 
purposes of comparison it would be very useful for the experiments to have one truly common algorithm. 

•^A streamlined (faster) version of the seedless algorithm was used during the early stages of CDF in Run II, but was 
dropped because of the near equivalence of the results obtained with the Midpoint cone algorithm. 
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There is certainly a lesson to be learned here for the collaborations at the LHC. 
3.1.5 Summary 

In summary, the iterative cone algorithm benefits, in principle, from a simple geometric definition that 
allows simple corrections for the UE contributions. At the same time it suffers from the experimental 
use of seeds and the need to implement a split/merge stage. The inclusion of the Rsep parameter in the 
perturbative calculation, while allowing more detailed comparisons to data, has also served to confuse 
the situation. Before discussing the role of fragmentation and smearing in the cone algorithm, we turn 
now to the other primary jet algorithm, the kx algorithm. 

3.2 Algorithm 

With the mixed history of success for the cone algorithm, the (as yet) less well studied k^ algorithm [TSl 
[TU] offers the possibility of more nearly identical analyses in both experiments and in perturbation theory. 
This algorithm, which was first used at electron-positron colliders, is based on a pair-wise recombination 
scheme intended, in some sense, to "undo" the splitting that occurs during the fragmentation stage. 
Two partons/particles/calorimeter towers are combined if their relative transverse momentum is less 
than a given measure. To illustrate the clustering process, consider a multi-parton final state. Initially 
each parton is considered as a proto-jet. The quantities 

kT,{i,j) = min(p|^i,p|^^.)-^ 

are computed for each proto-jet i and each pair of proto-jets ij, respectively. As earlier, p^ j is 
the transverse momentum of the z*'^ proto-jet and ARi j is the distance (in y, (p space, ARi j = 

\J iUi ~ Ujf + (0j ~ between each pair of proto-jets. D is the parameter that controls the size of 
the jet (analogous to Rcone)- If the smallest of the above quantities is a fef^^, then that proto-jet becomes 
a jet and is removed from the proto-jet list. If the smallest quantity is a j); then the two proto-jets 
(i, j) are merged into a single proto-jet by summing their four-vector components, and the two original 
entries in the proto-jet list are replaced by this single merged entry. This process is iterated with the 
corrected proto-jet list until all the proto-jets have become jets, i.e., at the last step the ^^(j j) for all 
pairs of proto-jets are larger than all for the proto-jets individually {i.e., the remaining proto-jets 
are well separated) and the latter all become jets. 

Note that in the pQCD NLO inclusive /c^ jet calculation, the parton pair with the smallest k"^ may 
or may not be combined into a single jet, depending on the k'^^ of the individual partons. Thus the final 
state can consist of either 2 or 3 jets, as was also the case for the cone algorithm. In fact, the pQCD 
NLO result for the inclusive kx jet cross section [18] suggests near equality with the cone jet cross section 
in the case that D ~ 1.35-Rcone (with no seeds, Rsep = 2). Thus the inclusive cone jet cross section with 
Rcone = 0.7 (-Rsep = 2) is Comparable in magnitude to the inclusive kx jet cross section with D = 0.9, 
at least at NLO. In the NLO language illustrated in Fig. Othe condition that the partons be merged 
in the kx algorithm is that z"^ {d? / D"^) < or d < D. Thus, at NLO, the kx algorithm corresponds 
to the cone algorithm with Rcone = D, R^ep = 1- The earlier result, D ~ 1.35-Rcone (with R^^p = 2), is 
just the NLO statement that the contribution of the rectangular region < d < 1.35Rcone, < z < 1 is 
numerically approximately equal to the contribution of the rectangular region < d < Rcone, < 2; < 1 
plus the (3 stable cone) triangular region Rcone ^ d < {1 + z) Rcone, < 2; < 1. 

In contrast to the cone case, the kx algorithm has no problems with overlapping jets and, less 
positively, every calorimeter tower is assigned to some jet. While this last result made some sense in 
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the e+e^ collider case, where every final state particle arose from the short-distance process, it is less 
obviously desirable in the hadron collider case. While the k-r algorithm tends to automatically correct 
for the splash-out effect by re-merging the energy distribution smeared by showering and hadronization 
into a single jet, this same feature leads to a tendency to enhance the splash-in effect by "vacuuming 
up" the contributions from the underlying event and including them in the large /cf^^ jets. This issue 
is exacerbated when the luminosity reaches the point that there is more than one collision per beam 
bunch crossing and pile-up is significant. This is now true at the Tevatron and will certainly be true 
eventually at the LHC. Thus while the (splash-out) fragmentation corrections for the kx algorithm are 
expected to be smaller than for cone algorithms, the (splash-in) underlying event corrections will be 
larger. A test of our understanding of these corrections will be provided by the comparison of the D 
and Rcone parameter values that yield comparable experimental jet cross sections. If we can reliably 
correct back to the fixed order perturbative level for both the cone and kx algorithms, we should see 
D ~ 1.35Rcone- Note that this result assumes that the cone jet cross section has been corrected to 
the value corresponding to Rgep = 2. On the other hand, under-corrected splash-in contributions in the 
kT algorithm will require D < 1.35-Rcone for comparable jet cross section values (still assuming that 
Rsep = 2 describes the cone results). If the cone algorithm jet cross section has under-corrected splash- 
out effects (-Rsep < 2), we expect that an even smaller ratio of D to Rcone will be required to obtain 
comparable jet cross sections (crudely we expect D < {1 + 0.35{Rsep — i))Rcone for 1 < Rsep < 2). As 
we will discuss below, the current studies at the Tevatron suggest that D < Rcone for comparable cross 
sections implying that indeed the kx algorithm is efficiently vacuuming up extra particles. However, 
once corrected back to the parton level, the /c-r algorithm cross section is smaller than the cone result 
for D = Rcone as expected. 

Another concern with the kx algorithm is the computer time needed to perform multiple evaluations 
of the list of pairs of proto-jets as 1 pair is merged with each pass, leading to a time that grows 
as A^^, where A^ is the number of initial proto-jets in the event. Recently [20] a faster version of 
the kx algorithm, "Fastjet", has been defined that recalculates only an intelligently chosen sub-list 
with each pass and the time grows only as A^lnA^, for large A^. The software in [20] also provides 
alternative versions of the kj- algorithm. As defined above, the algorithm is the inclusive version, 
keeping all possible jets defined by the parameter D. There is also an exclusive version [12] where 
another parameter dcut is introduced. Merging stops when all remaining k'^^^.-^ and k'^^ exceed dcut 
and the remaining /cf, • define the exclusively defined jets {i.e., the previously removed k'^^ < dcut jets 
are discarded). The Fastjet code also includes the so-called Cambridge/Aachen kx algorithm [2T][22] . 
the inclusive version of which is defined similarly to the algorithm above except that the prefactor 
min(p|. j,p|ij) is absent from ^^"^ ^Ti — 1- Finally the Fastjet code for the kx algorithm also 

includes an innovative technique for defining the "area" of the jet and allowing a correcting for the UE 
contribution. Fake or "ghost" particles with exponentially tiny energies are added to each event on an 
essentially uniform grid in (y, 0) {i.e., each ghost particle is representative of a fixed area in {y, 0)). The 
final jets found by the algorithm will then contain some of the ghost particles. While the kinematics 
of the jet is unchanged by the presence of the ghost particles (since they have such tiny energies), their 
number gives a measure of the area in (y, </>) of the jet. 

It should also be noted that, although it would appear that the inclusive kx algorithm is defined 
by a single parameter D, the suggested code for the kx algorithm [22] includes several switches and 
parameters to fully define the specific implementation of the algorithm. Thus, as is the case for the 
cone algorithm, the kx algorithm also exhibits opportunities for divergence between the implementation 
of the algorithm in the various experiments, and care should be taken to avoid this outcome. 



17 



3.3 Jet Masses for Jets at NLO 



As has already been suggested, the invariant masses of jets are expected to play an increasingly impor- 
tant role at the LHC as a useful jet property. For example, jet masses can help to isolate events where 
individual jets correspond to essentially all of the decay products of boosted heavy objects (say top 
quarks or W bosons). It is helpful to set the stage for these analyses by discussing first the expected 
magnitude of masses of jets arising from perturbative QCD interactions only. We will return to the 
question of jet masses in Section E] where showering, hadronization and UE effects will be included. The 
NLO configurations with 2 partons in a jet, corresponding to 2; > and d > 0, 1 cone — >1 jet and 3 
cones —>■ 1 jet in Fig. [6l will result in perturbative jets with nonzero masses. While the distribution of 
jet masses will be singular at the origin in perturbation theory, due to the soft and coUinear singular- 
ities, the average jet mass (squared) is an infrared safe quantity that can be evaluated order-by-order 
in perturbation theory. In particular, to find the average jet mass squared for a fixed jet pj at NLO 
we simply evaluate the NLO inclusive jet cross section at that fixed pj weighted by the corresponding 
jet mass squared, where the evaluation involves a sum over all 2 partons-in-a-jet configurations with 
the required pj . We then divide this sum by the corresponding Born level inclusive jet cross section. 
Note that the notation for the perturbative order used here corresponds to the order of the jet cross 
section. Thus at LO jets correspond to single partons with vanishing jet mass. As noted above, NLO 
jets receive contributions from 2 parton configurations with nonzero jet masses, with the masses pro- 
portional to a single power of a^. In detail this calculation is complicated due to the large number of 
perturbative processes that contribute and the fact that the available phase is restricted by the pdfs in 
a way that varies with pjl\fs. On the other hand, the expected general form of the perturbative jet 
mass is straightforward to motivate. By dimensional analysis the dominant contribution to the NLO 
jet mass squared will scale with pj, will scale with the algorithm defined "size" of the jet, i?^ or and 
exhibit a factor of a^. Choosing the factorization/renormalization scale in the running coupling to be 
/X = Pj/2 we are led to expect for a cone algorithm 

<*^J>™-c(^)"-(f)f5fl^ 

where the prefactor (prefunction) C has a magnitude of order unity and decreases slowly with increasing 
Pj/ \/s- Using the CTEQ 6.2 pdfs, averaging over jet rapidities in the range \yj\ < 2.5 and employing 
the EKS NLO inclusive jet code |T0] for the cone jet algorithm, we find the average NLO jet masses 
illustrated in the following figures. Due to the expected simple linear momentum dependence we will 
focus on the linear jet mass, \/{MThux) — V CusPjR. The simplest dimensionful NLO predictions for 
the jet mass at both Tevatron and LHC energies are illustrated in Fig. [TOl 

We see that for low momentum jets, pj ~ 100 GeV, the jet mass is relatively independent of the 
overall energy s. On the other hand, for substantially larger momenta the reduced phase space at the 
Tevatron leads to smaller predicted jet masses at the same momentum. We can simplify the discussion 
by first scaling out the (necessary dimensionful) linear dependence on pj as displayed in Fig. [TTl As 
suggested earlier we can largely correct for the overall energy difference by plotting instead versus the 
momentum fraction, xt = 2pj/y/s, as illustrated in Fig. [121 Now the impact of the differing beam 
energies (and beam flavor) is much reduced with scaled jet mass distributions of very similar shape, and 
the scaled LHC jet mass smaller than the scaled Tevatron jet mass at the same xt by approximately 10 
to 15%. The common falling shape of the distributions as functions of xt can be understood as arising 
from the falling coupling, the falling pdfs and the transition from dominantly gluons at small xt with 
C oc Ca = 3 to dominantly quarks (and anti-quarks) at large xt with C oc Cp = 4:/3. The difference in 
the magnitude of the two distributions arises from the scale breaking in the theory {pj is larger at the 
LHC at the same xt) with both the running coupling and the running pdfs being smaller at the LHC. 
We can verify the approximately linear dependence on the cone radius R by both varying the radius 
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Figure 10: NLO jet masses for a cone jet with R = 0.7 and Rsep = 1-3 with Tevatron and LHC energies 
and beams. 
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Figure 11: NLO jet mass for a cone jet with R = 0.7 and Rgep = 1.3 scaled by the jet momentum pj 
with Tevatron and LHC energies and beams. 
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Figure 12: NLO jet mass for a cone jet with i? = 0.7 and Rsep = 1-3 scaled to the jet momentum pj 
plotted versus xt = Ipjj \fs with Tevatron and LHC energies and beams. 



and scaling it out as in Fig{T3] for LHC beam energies. 

While the dependences on pj and R are not exactly linear, the NLO jet mass is remarkably well 
described by the simple rule-of-thumb 

where the numerical prefactor of 0.2 (approximately) includes the dependence on ^^aj, the color 
charges (a mix of Ci? and Ca) and the pdfs. For better than 25% accuracy the still simple pref- 
actor (0.2 + (0.3 — Xt) /6) suffices. 

Finally consider the dependence on the ad hoc parameter Rsep- This is illustrated in Fig. [141 We see 
the expected monotonic dependence on Rsep^ where the variation is somewhat less rapid than linear. 
In the language introduced earlier we see that the mass for a naive Snowmass cone jet [Rsep = 2.0) 
is expected (in NLO perturbation theory) to be approximately 30 to 50% larger than the mass of a 
kr jet [Rsep = 1-0) with D = R. Qualitatively we anticipate that, compared to NLO perturbation 
theory, the inclusion of showering, hadronization and the underlying event will lead to "splash-out" 
effects and smaller jet masses for the cone algorithm, but "splash-in" and larger jet masses for the 
algorithm. Hence the masses of jets found in realistic environments by different algorithms, as discussed 
in Section EJ, are expected to be more similar than suggested by NLO perturbation theory and Fig. [TH 

In summary, perturbative QCD (alone) leads us to expect jet masses at the LHC that grow slightly 
more slowly than linearly with the jet momentum, scale linearly with the jet algorithm defined jet size, 
R or D, and have a magnitude approximately (±25%) described by \/^JMj)nlo ~ 0.2pjR. The explicit 
prefactor (0.2) corresponds to Rsep = 1-3 cone jets with Snowmass jets {Rsep = 2.0) having masses 
larger by 10% and NLO kr jets {Rsep = 1-0) having masses smaller by 20%. 
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Figure 13: NLO jet mass for a cone jet with Rs^p = 1.3, ^/s = 14 TeV scaled by the jet momentum pj 
and the cone radius R plotted versus xt = '2^p.j / \/s for various values of the cone radius R. 

3.4 Recent Cone Algorithm Issues 

3.4.1 Jets at the "Smeared" Parton Level and Dark Towers 

In studies of the Run II iterative cone algorithms a previously unnoticed problem has been identified [23] 
at the particle and calorimeter level, which is explicitly not present at the NLO parton level. It is 
observed in a (relatively small) fraction of the events that some energetic particles/calorimeter towers 
remain unclustered in any jet. This effect is understood to arise in configurations of two nearby {i.e., 
nearby on the scale of the cone size) showers, where one shower is of substantially larger energy. Any 
trial cone at the location of the lower energy shower will include contributions from the larger energy 
shower, and the resulting centroid will migrate towards the larger energy peak. This feature is labeled 
"dark towers" in Ref. [23], i. e., clusters that have a transverse momentum large enough to be designated 
either a separate jet or to be included in an existing nearby jet, but which are not clustered into either. 
A Monte Carlo event with this structure is shown in Fig. [T51 where the towers unclustered into any jet 
are shaded black. 

A simple way of understanding the dark towers can be motivated by returning to Fig. [71 where the 
only smearing in {y, 0) between the localized energy distribution of panel a) (the individual partons) 
and the "potential" of panel b) arises from the size of the cone itself. On the other hand, we know that 
showering and hadronization will lead to further smearing of the energy distribution and thus of the 
potential. Sketched in Fig. (TH] is the potential (and the energy-in-cone) distributions that results from 
Gaussian smearing with a width of a) a = 0.1 and b) cr = 0.25 (in the same angular units as i? = 0.7). 
Note that the smearing of the potential occurs in 2 dimensions and that here we are considering only a 1 
dimensional slice. In both panels, as in Fig. [TJ the partons have pt ratio z = 0.6 and angular separation 
d = 1.0. Note that as the smearing increases from zero as in panel b) of Fig. [Tjto the smeared results 
in Fig. [ini we first lose the (not so deep) minimum corresponding to the midpoint stable cone (and jet), 
providing another piece of the explanation for why showers more than 1.3 ■ Rcone apart are not observed 
to merge by the experiments. In panel b), with even more smearing, the minimum in the potential 
near the shower from the lower energy parton also vanishes, meaning this (lower energy) shower is part 
of no stable cone or jet, i.e., leading to dark towers. Any attempt to place the center of a trial cone at 
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the position of the right parton will result in the centroid "flowing" to the position of the left parton 
and the energy corresponding to the right parton remaining unclustered in any jet. Note that the Run 
I CDF algorithm, JETCLU with ratcheting, limited the role of dark towers by never allowing a trial 
cone to leave the seed towers, the potential dark towers, behind. The effective smearing in the data is 
expected to lie between a values of 0.1 and 0.25 (with shower-to-shower fluctuations and some energy 
dependence, being larger for smaller "px jets) making this discussion relevant, but this question awaits 
further studies. Note that Fig. [16] also suggests that the Midpoint algorithm will not entirely fix the 
original issue of missing merged jets. Due to the presence of (real) smearing this middle cone is often 
unstable and the merged jet configuration will not be found even though we explicitly look for it with 
the Midpoint cone. Thus, even using the recommended Midpoint algorithm (with seeds), there may 
remain a phenomenological need for the parameter value Rse%, < 2, i.e., a continuing mismatch between 
data and pQCD that requires correction. 



3.4.2 The Search Cone Algorithm 

A potential solution for the dark towers problem is described in Ref. [23]. The idea is to decouple 
the jet finding step from the jet construction step. In particular, the stable cone finding procedure 
is performed with a cone of radius half that of the final jet radius, i.e., the radius of the search cone, 
Rsearch = Rcone/'^- This proccdurc rcduccs the smearing in Figs. [7| and [TBI cind reduces the phase 
space for configurations that lead to dark towers (and missing merged jets). This point is illustrated in 
Fig- [13 which shows the potential of Fig. [161 panel b) corresponding to the reduced radius search cone. 
Note, in particular, that there is again a minimum at the location of the second parton. Seeds placed at 
each parton will yield a stable cone at each location even after the smearing. Using the smaller search 
cone size means there is less influence from the (smeared) energy of nearby partons. After identifying 
the locations of stable cones, the larger cone size, e.g., Rjet = Rcone = 0.7, is used to sum all objects 
inside and construct the energy and momentum of the jet (with no iteration). All pairs of stable cones 
separated by less than 2Rcone are then used to define midpoint seeds as in the usual Midpoint cone 
algorithm. A trial cone of size Rcone is placed at each such seed and iterated to test for stability. (Note 
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Figure 15: An example of a Monte Carlo inclusive jet event where the Midpoint algorithm has left 
substantial energy unclustered. 




Figure 16: Energy-in-cone and potential distributions corresponding to Gaussian smearing with a) 
a = 0.1 and b) a = 0.25 for d = 1.0 and z = 0.6. 
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Figure 17: The stable cone finding potential with the reduced search cone, Rsearch = Rconef^- The 
original potential from Fig. [TBI panel b) with Rsearch = Rcone is indicated as the dashed curve. 



that this Midpoint cone is iterated with cone size Rcone, not the smaller Rsearch, contrary to what is 
described in the literature [25] •) Thus, just as in the Midpoint cone algorithm, stable midpoint cones 
will be found by the CDF Search cone algorithm. However, as already discussed, we expect that there 
will be no stable midpoint cone due to the smearing. Note that, even with the reduced smearing when 
using the smaller search cone radius, there is still no central stable cone in the potential of Fig. [T71 On 
the other hand, as applied to NLO perturbation theory without smearing, the Search cone algorithm 
should act like the usual Midpoint cone algorithm and yield the naive result of Fig. [6] a). The net 
impact of adding the step with the smaller initial search cone as applied to data is an approximately 
5% increase in the inclusive jet cross section. In fact, as applied to data the Search cone algorithm 
identifies so many more stable cones, that the CDF collaboration has decided to use the Search cone 
algorithm with the merging parameter fmerge = 0.75 (instead of 0.5) to limit the level of merging. 

Unfortunately, a disturbing feature of the Search Cone Algorithm arises when it is applied at higher 
orders in perturbation theory as was pointed out during the TeV4LHC Workshop. At NNLO in pertur- 
bation theory, the Search cone algorithm can act much like the seeds discussed earlier. In particular, 
the Search cone algorithm can identify a (small radius Rsearch) stable (soft) cone between two energetic 
cones arising from the soft gluon between 2 energetic partons configuration discussed earlier. The soft 
search cone is stable exactly because it "fits" between the two energetic partons without including either; 
the spacing between the two energetic partons can be in the range 2Rsearch = Rcone < Ai? < 2Rcone- 
Then, when the radius of the (stable, soft) search cone is increased to Rcone, the resulting full size 
cone will envelop, and serve to merge, the two energetic partons. This can occur even when the two 
energetic partons do not constitute a stable combined cone in the standard cone algorithm. Thus at 
NNLO, the Search cone algorithm can exhibit an IR-sensitivity very similar to, and just as undesirable 
as, the seed-induced problem discussed earlier. The conclusion is that the Search cone algorithm, while 
it does address the dark tower issue, creates its own set of issues and is not considered to be a real 
solution of the dark tower problem. 
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3.4.3 The Midpoint Cone Algorithm with a "Second Pass" 



The recommendation adopted in the TeV4LHC workshop, and which we also endorse, is to remove 
the Search Cone from the Midpoint cone algorithm and to substitute a "second-pass" step. After all 
stable cones are found in an event, the towers corresponding to those jets are removed from the list, and 
the Midpoint cone algorithm is run again. The "dark towers" are then reconstructed as jets in their 
own right, as the attractive influence of nearby larger jets is no longer present. The question remains 
as to what to do with the second-pass jets. According to the criteria discussed above, some of them 
should be merged with the larger nearby jet, while others should remain as separate jets. As separate 
jets, the second-pass jets have an insignificant impact on inclusive jet cross sections. If added to the 
nearby larger jets, they can have an effect of the order of up to 5%. The correct treatment is still under 
investigation. Note that the second-pass jets may also be important in the accurate reconstruction of 
a complicated multi-jet final state, such as in ti events. 

It is also important to keep in mind that the experimental reality in which jet algorithms are applied 
is more complex than the 1-D slice shown in Fig. [111 The final-state hadrons are distributed in two 
dimensions {y and 0) and the resultant Snowmass potential has fluctuations in these two dimensions. 
This has implications regarding the presence or not of dark towers and of the effect of the distribution 
of the jet energies on the stability of cone centroids. 

3.4.4 Summary 

In summary, to compare fixed order perturbation theory with data there must be corrections for detector 
effects, for the splash-in contributions of the underlying event (and pile-up) and for the splash-out effects 
of hadronization (and for showering when comparing to leading order calculations). It is the response 
to both the splash-in and splash-out effects that distinguishes the various cone algorithms and drives the 
issues we have just been discussing. It is also important to recognize that the splash-in and splash-out 
effects (and corrections) come with opposite signs and there can be substantial cancellation. We will 
return to the question of Run II corrections below. The conclusion from the previous discussion is 
that it would be very helpful to include also a correction in the experimental analysis that accounts for 
the use of seeds, or to use a seedless cone algorithm such as SISCone [IE]. Then these experimental 
results could be compared to perturbative results without seeds, avoiding the inherent infrared problems 
caused by seeds in perturbative analyses. At the same time, the analysis described above suggests that 
using the Midpoint cone algorithm, to remove the impact of seeds at NLO , does not fully eliminate the 
impact of the smearing due to showering and hadronization, which serves to render the Midpoint cone 
of fixed order perturbation theory unstable. The same may be true, but possibly to a lesser extent, with 
the use of a seedless algorithm such as SISCone, which increases the number of identified stable cones. 
Thus we should still not expect to be able to compare data to NLO theory with Rsep = 2, although 
the impact may be somewhat reduced. The possible downside of finding more stable cones is that the 
split /merge step will play a larger role, with no analogue effect in the perturbative analysis. Splitting 
the calorimeter towers with a host of low Ex stable cones can serve to lower the of a leading jet and 
introduce an undesirable dependence on the minimum cut defining which stable cones are included 
in the split/merge process. This is surely only a percent level correction, but control at that level is 
the ultimate goal. 

3.5 Jets at the Hadron Level 

The last step before the detector in our factorized, approximate picture of a hard scattering event 
is the hadronization step. Here the colored partons arising from the hard scattering itself, from 
ISR and FSR and from the UE are organized into color singlet hadrons. Thus this step requires 
information about the color flow within the event as the event evolves from short to long distance, and 
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that information must be maintained during the showering process. The initial color single hadrons will 
include resonances, which are then decayed into the long lifetime, long distance final hadrons (primarily 
pions and kaons) according to the known properties of the resonances. Essentially by definition, this 
step represents non-perturbative physics, is model dependent and therefore not well understood. On 
the other hand, by construction, it looks for minimal mass color singlet final states while conserving 
energy and momentum and so results in only a minimal kinematic rearrangement. Thus, although 
there is considerable uncertainty about the details of hadronization, the impact on jet reconstruction 
is small and is fairly easily corrected for in both types of algorithms except perhaps at the lowest jet 
energies. In experimental analyses, these corrections are estimated using leading-order parton-shower 
Monte Carlo generators, by observing the variation of the predicted jet cross sections after turning off 
the interaction between beam remnants and the hadronization. This procedure relies on the Monte 
Carlo providing a good description of those observables in the data that are most sensitive to non- 
perturbative contributions such as, for example, the underlying event energy away from jets and the 
internal structure of the jets. Recent precise measurements on jet shapes [2^], as indicated in Fig. [THl 
have allowed the detailed study of the models employed to describe the underlying event in inclusive jet 
production at the Tevatron (see also Ref. [26]). Besides the contribution from the UE the jet shape is 
also sensitive to the changing character of the scattered parton, i.e., gluons at low pj- and quarks at high 
Pt, and to the perturbative scaling of the jet size with (pr)- The Tevatron studies, which will be 
discussed in more detail below, suggest that these last effects are reasonably well described both by the 
showers in Monte Carlo simulations and by pQCD analyses. Future measurements of the underlying 
event in Run II, for different hadronic final states, promise to play a major role in the understanding 
of the measured jet cross sections at the LHC. 
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Figure 18: The fraction of pt in a cone jet of radius 0.7 that lies in the annulus from 0.3 to 0.7 as a 
function of jet px- The CDF Run II measurement is compared to predictions from Herwig and Pythia 
with different set of parameters (left), and to the separate predictions for quark and gluon jets (right). 
Figures from Ref. [25] . 



Overall the challenge for the future, as noted above, is to continue to reduce the systematic un- 
certainties below the current 10% to 50% level, which effectively guarantees agreement with theory. 
Indeed, Run II studies of the corrections due to splash-in, i.e., the underlying event (and pile- up), 
and the splash-out corrections due to hadronization are much more sophisticated than in Run I and 
presented in such a way that they can be applied either to the data (corrected for the detector) or to 
a theoretical (perturbative) calculation. The evaluation of these corrections is based on data and the 
standard Monte Carlos, Pythia and Herwig, especially Tune A of Pythia, which seems to accurately 
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simulate the underlying event in terms of multiple parton interactions, including the observed correla- 
tions with the hard scattering process [SB]. We turn now to a detailed discussion of the experiences 
and lessoned learned at the Tevatron. 

4 Jets at the Tevatron 

Most of the interesting physics processes in pp collisions at the Tevatron include jets in the final state. 
Therefore, the reconstruction and precise measurement of jets serve as key ingredients for the better 
understanding of hard scattering processes and for new physics searches at the Tevatron. 

As an example, consider top quark production. The top quark was discovered by the CDF and 
D0 experiments in 1995 [271 EHl EHj , and since then studies of the top quark production and properties 
have been considered to be one of the primary physics goals for the Tevatron experiments. Top quark 
decays always have at least one jet in the final state and the systematic uncertainty of the top quark 
mass measurement is dominated by the jet energy scale uncertainty. The search for the Standard 
Model Higgs boson is also an important aspect of the Tevatron physics program. The Standard Model 
Higgs boson dominantly decays to bb for masses mn ^ 135 GeV/ c. The bb dijet mass resolution 
has been stressed as one of the critical factors in the search for a light Higgs boson decay at the 
Tevatron [301 EI]; a 20% decrease of am/'m^i is expected to be have a similar effect as a 20% increase 
in the accumulated luminosity. Inclusive jet cross section measurements have also been undertaken 
at the Tevatron. Such measurements are sensitive to new physics such as quark compositeness and 
the presence of new heavy particles from Beyond-the-Standard-Model scenarios, and also they provide 
crucial information on parton distribution functions for the proton. 

4.1 Detectors 

CDF and D0 are the two general-purpose detectors designed for measurement of pp collisions at the 
Fermilab Tevatron collider. Both detectors are composed of a solenoidal-magnet charged particle spec- 
trometer surrounded by (sampling) calorimetry and a system of muon chambers. The components 
most relevant for jets are the calorimeters, which are used to measure the energy and angle of particles 
produced in pp collisions. The CDF and D0 calorimeters both have a projective tower geometry as 
shown in Fig. [111 

The D0 calorimeters are uranium and liquid argon sampling calorimeters. The D0 calorimetry 
consists of a central section covering |?7| ^ 1 and an endcap section extending the coverage to \ri\ ~ 4. 
The calorimetry has three sections: EM, fine hadronic (FH) and coarse hadronic (CH). In the CH 
section, copper or steel is used instead of uranium as an absorber. The D0 calorimeters are nearly 
compensating, with an e/vr ratio less than 1.05 above 30 GeV. The tower segmentation in rj-cf) space 
is 0.1 X 5.625°. The average noise per channel is on the order of 50 MeV. The relatively long charge 
collection time for the liquid argon signal and the resultant electronic shaping of the signal results in 
the possibility of towers with (modest) negative energy. 

The CDF electromagnetic (EM) section consists of alternating layers of lead and scintillator, while 
the hadronic (HAD) section consists of alternating layers of steel and scintillator. The CDF calorimetry 
is divided into two main pseudorapidity (|?7|) regions; the central calorimeter covers |?7| < 1.1 and the 
plug region covers 1.1 < |?7| < 3.6. The region between the central and plug regions is covered by the 
end-wall hadron calorimeter which has a similar construction as the central hadron calorimeter. The 
tower size in the central region is Ar/ x A0 ^ 0.1 x 15°, and the segmentation in the plug region varies 
as a function of t] (Ar/ x A0 0.1 x 7.5° for \ri\ < 1.8 and with Arj increasing for larger |?7| values). 
The noise level is very low, with only ~ 1 noise tower with Et > 50 MeV being present per event. The 
Et threshold on each calorimeter tower used for jet reconstruction is 100 MeV, and no special care is 
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necessary for noise suppression. 



4.2 Jet Reconstruction and Energy Measurement 

In contrast to leptons or photons, jets are clusters of "objects" that have to be defined by a clustering 
algorithm, as discussed previously in this paper. In parton shower Monte Carlo events, a jet clustering 
algorithm can be applied to particles produced after hadronization of the partons, producing hadron- 
level jets; for pQCD parton-level predictions, jets are formed by running the algorithm on partons from 
the fixed-order (typically next-to-leading order) pQCD event generator. Experimentally, it is extremely 
challenging to both identify and measure the kinematic properties of the large variety of individual 
particles produced in high-energy hadronic collisions, especially with the relatively coarse calorimeter 
segmentation present in the Tevatron experiments. As a result jets are reconstructed by running the 
jet algorithm on the "raw" energy depositions in the calorimeter towers, and not on the individual 
particles. 

The reconstruction and energy measurement of these "calorimeter" jets are affected by a variety 
of instrumental and physics effects. The instrumental effects include (1) calorimeter non- uniformity, 
(2) resolution effects due to large fluctuations in particle showering in the calorimeter, (3) non-linear 
response of the calorimeter, especially to hadrons, and (4) low momentum particles not reaching the 
calorimeter due to materials in front of the calorimeter and the solenoidal magnetic field. Energy from 
additional pp interactions occurring in the same bunch crossing also affects the jet reconstruction and 
energy measurement. This has a non-negligible effect especially for low transverse momentum jets 
and high instantaneous luminosities. The situation is further complicated due to the fact that (a) the 
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underlying event contributes energy to the jet cone, and that (b) the jet cone does not contain all the 
energy of the parent parton because of the effects of parton showering and hadronization. All of these 
effects have to be taken into account when comparing experimental data with theoretical predictions 
or extracting physics quantities of interest from experimental data. 

Analyses take these physics effects into account in different manners. For example, in new physics 
searches or in measurements of top quark properties, attempts are often made to correct the jets 
measured in the calorimeters back to the parent parton from which the jet originated. Extraction of 
the physics quantities of interest is done by comparing the data, after background subtraction, with 
signal events implemented in the parton shower Monte Carlo. 

Typically in jet cross section measurements, the data are corrected to the hadron- level {i.e., cor- 
rected only for instrumental effects); at this level, the results can be compared with hadron- level theo- 
retical predictions without any knowledge of the detectors. We strongly encourage that, where possible, 
measurements at the Tevatron or LHC produce results at this level; often, instead the predictions from 
Monte Carlos are themselves passed through a detector simulation program and then directly compared 
to the "raw" data distributions. 

Sometimes, the theoretical predictions from perturbative QCD calculations are available only at the 
parton level and then it is necessary to either correct the theory to the hadron level or the data to 
the parton level. For inclusive jet cross sections, the best theoretical predictions available, as of now, 
are from next-to- leading order perturbative QCD calculations [lOl [SU [35] . When making comparisons 
between experimental measurements and next-to-leading (NLO) order predictions, non-perturbative 
QCD effects from the underlying event and hadronization (see Section 14.31 for details) are evaluated 
based on tuned parton shower Monte Carlos. Note that a correction is not made for "out-of-cone" 
energy from hard gluon emission, as such effects are already (at least partially) taken into account in 
the NLO calculations. In these comparisons, next-to-leading order hard gluon emission is assumed to 
model the whole parton shower perturbative QCD radiation process. NLO pQCD calculations have 
been shown to provide a reasonable description of energy flow inside jets |3B], so this is considered to 
be a reasonable assumptior0; however, data-theory comparisons will benefit a great deal from higher 
order QCD calculations or next-to-leading matrix element calculations interfaced with parton shower 
models, as in e.g., MC@NLO [37]. 

The jet algorithms and procedures for jet energy scale correction to the hadron-level {i.e., correction 
for instrumental effects) employed by CDF and D0 are discussed below. 



4.2.1 Jet Algorithms 

In Run II, D0 uses the Midpoint iterative cone algorithm with the cone radius Rcone = 0.5 and 0.7 
and merging criterion (see Section [3. 1.41) f merge = 0.50. At CDF, several jet algorithms are in use. In a 
large fraction of analyses, the JETCLU cone clustering has been used, which was used also in Run I at 
CDF, with the cone radius Rcone = 0.4 and 0.7 and the merging fraction f merge = 0.75. The Midpoint 
algorithm with the search cone step, i.e., the Search cone algorithm (see Section r3.4.2p . with the cone 
radius of Rcone = 0.7 and the merging fraction of f merge = 0.75 was also used in several jet cross section 
measurements; however, because of the IR- sensitivity introduced by the search cone step, the search 
cone step is being removed from the clustering. The algorithm was also used in the inclusive jet 
cross section measurements with D = 0.5, 0.7 and 1.0. 



■^As r ^ inside a jet, however, the energy profile function develops large logarithms of the form a" log^"~^ r; for 
an accurate description of this region, these terms would have to be re-summed. The pure NLO description should be 
accurate for r values greater than 0.3. 
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4.2.2 Jet Energy Scale at D0 



The jet energy calibration procedure employed by D0 is based primarily on data, exploiting the conser- 
vation of transverse momentum [38]. The measured jet energy is corrected back to the true hadron-level 
jet energy by: 

rpmeasured rp 

j^hadron _ jet ^ ^ 



jet R Q ' 

where Eq is an offset energy which includes the uranium noise, energy from the previous bunch crossing, 
and additional pp interactions. The offset Eq is determined by measuring the transverse energy density 
in the minimum-bias and zero-bias data. Rjet represents the calorimeter response to jets. This corrects 
both for the calorimeter non- uniformity in rj and for the absolute energy scale. S is the showering 
correction, which corrects for the energy emitted outside the jet cone due to detector effects. 

D0 uses the missing E^ projection fraction (MPF) method [38], which exploits the transverse 
momentum conservation in an event, to measure the calorimeter response to jets {Rjet)- In photon+jet 
events, for example, the transverse energies of the real photon and the other recoil particles at the 
hadron level should satisfy: 

+ = 0. (2) 

In general, the calorimeter response to both photons and recoils is less than unity and the energy 
mis-measurement results in missing Et (-^t) in events: 

I? Z?T I I? jprecoil 77! fQ\ 

n^Hirp -\- JXrecoilJ-'T —~JpT- I, -J J 

After the EM energy calibration, i?^ = 1, and Eqs. ([2]), ([3]) transform to: 



Rrecoil f j-17 ; (4) 



E^ 



where = Ep/\Ep\. In back-to-back photon+jet events, Rrecou can be considered as the response of a 
jet (Rjet)- The absolute jet response correction is determined and applied after the response is equalized 
as a function of rj. Fig. [20r a) shows the jet energy scale systematic uncertainty achieved for central jets 
as a function of jet energy at D0. 



4.2.3 Jet Energy Scale at CDF 

At CDF, the determination of the jet energy absolute (response) correction at the hadron-level relies 
primarily on a detector simulation and jet fragmentation model [33]. Therefore, the accuracy of the 
calorimeter simulation is crucial for the precise jet energy scale determination. The CDF calorimeter 
simulation response to single particles is tuned to reproduce the response measured both in the test beam 
and in the collision data taken with the real detector. The CDF detector simulation uses a Geant-based 
detector simulation in which a parametrized shower simulation (Gflash [39]) is used for the calorimeter 
response. The use of Gfiash was primarily motivated by its excellent performance in terms of speed, but 
also the relative ease of tuning. After tuning, the absolute correction for calorimeter-level jets to the 
corresponding hadron-level jets is obtained on average from dijet Monte Carlo events by matching the 
two leading hadron-level jets to the corresponding calorimeter jets, and then taking the most probable 
value of calorimeter-jet pt for hadron-level jets with a given p^^^/eT"- 

This method can be used in the entire calorimeter coverage region; however, in practice the tuning is 
limited in precision outside the central region (0.2 < |?7| < 0.6) because of the limited tracking coverage. 
Thus, the jet energy scale outside the central region is rescaled to that of the central region based on 
the Pt balance of the leading two jets in exclusive {i.e., stringent cuts are placed on the presence of 
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Figure 20: The jet energy scale systematic uncertainties for (a) central jets at r/ = as a function of 
uncorrected jet energy at D0 and (b) central jets in 0.2 < |?7| < 0.6 as a function of corrected jet pr at 
CDF. Figures from Refs. [32] and 



any additional jets in the event) dijet events; in such dijet events, two jets should have the same pt 
to leading order and any imbalance is due to calorimeter non-uniformity. This correction is called the 
relative correction. 

Additional pp interactions, in the same bunch crossing as the interaction which produced the jets, 
also contribute energy in the jet cone. The number of reconstructed primary vertices (N^tx) is a good 
estimator of the number of interactions in the same bunch crossing. For jets reconstructed with the cone 
algorithm, this multiple interaction correction, p^^ , is derived by measuring the transverse momentum 
in a cone of the same size as the jet cone in the central region as a function of N^tx in minimum bias 
events. For a cone size Rcone = 0.7, the correction subtracts 1 GeV/ c for each additional primary 

vertex. 

The CDF jet energy corrections described above can be summarized as, 

Prtr = [P~r' X - p^' X (AT,,, - 1)] X U,. (5) 

where frei is the relative jet energy correction factor as a function of jet 77 and pt and fabs is the absolute 
correction factor as a function of jet p^- 

At CDF, the photon-jet and Z-]et pt balances are used to cross-check the jet energy scale in data and 
Monte Carlo events. When photon-jet and Z-]ei balances are formed, tight cuts are made on the second 
jet Pt and between the photon/Z and jet, pt,2 < 3 GeV/c and Ac/) > 3 (rad), to suppress gluon 
radiation in the events that may affect the pt balance [33] . With the cuts discussed above, the Pythia 
and Herwig event simulation show differences in px balance at the level of ~ 2 — 3%, a disagreement 
which is not well understood yet. A better understanding of this difference will be extremely useful 
for future improvements of the jet energy scale uncertainty. The overall jet energy scale systematic 
uncertainty, evaluated as described above for central jets at CDF, is shown in Fig. [20r b): it is ~ 3% at 
high Pt and increases at pt < 50 GeV. 

The hadronic decays of resonances with well known masses, such as the W and Z bosons, can also 
be useful to test and to calibrate the jet energy scale. In most cases, the hadronic W and Z decays are 
swamped by QCD background at hadron colliders; however in, e.g., ti — > W{—^ > 4 jets events 
(referred to as lepton-|-jets events/channel hereafter), the hadronic W resonance can be observed with 
only a relatively small QCD background. 
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Figure 21: Dijet mass distributions for four subsamples in the lepton+jets sample. The signal and 
background shapes corresponding to the best fit of the jet energy scale cross-check are overlaid on the 
histograms. Clear W peaks can be observed, especially when one or more 6-tags is required. Figure 
from Ref. [33] . 



Hadronic W decays in tt events were first observed in Run I in lepton+jets events in Run I |40j . 
and in Run II these decays have been used successfully by both the CDF and D0 collaborations to 
finely calibrate the jet energy scale in top quark mass measurements in the lepton+jets channel |4Tll42]. 
Fig. EH shows the dijet mass distributions in four sub-samples of the lepton+jet event sample; the sub- 
samples have been divided based on the number of jets identified as 6-quark jets by the standard CDF 
6-tagging algorithm (see Section 1^61) . Events with one 6-tagged jet are further divided into two classes; 
l-tag(T) refers to events with four jets with Et > 15 GeV and l-tag(L) refers to events with three 
jets with Et > 15 GeV and the fourth jet with 8 < Et < 15 GeV. In the CDF measurement [H], the 
reconstructed top mass and dijet mass distributions are formed from tt MC events with various top mass 
and jet energy scales ranging from -3 to +3a where a is the total jet energy scale uncertainty described 
above. Fits to the data without using the jet energy scale constraint from the standard procedure 
yield the jet energy scale [—0.25 ± 1.22](j, indicating that the jet energy scale from the aforementioned 
procedure is in good agreement with information provided by the W resonance peak in tt events. 

Hadronic W/Z decays in the W/Z+'-f process have been also investigated by CDF. The observation of 
this signal can provide another way in which to validate and/or constrain the jet energy scale; however, 
the extremely large QCD background has made the observation of such a signal extremely difficult. 
Constraining the jet energy scale with the hadronic W resonance is a very powerful technique, and the 
jet energy scale uncertainty from this method will improve as more data are accumulated. However, it 
has to be noted that all the detailed studies presented above are crucial for the success of this technique, 
since this method relies on a good modeling of the dijet mass distribution. Also, this technique is not 
able to constrain the jet energy scale over a wide range of jet pt- 

4.2.4 Summary 

In this section (Sec. 14.21) . the strategies used by the D0 and CDF collaborations to determine the jet 
energy scale were presented. The two collaborations used the different approaches, but achieved to 
determine the jet energy scale with similar precision. It is worth noting that, although large groups of 
people worked on the jet energy scale determination in both experiments, it still took more than three 
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years to achieve the precision shown in Fig. [20] from the beginning of the Run II. High quality test beam 
analyses, thorough detector simulation tuning based on the test beam results, good planning for the jet 
energy calibration and validation based on the in-situ data (including the trigger implementation for 
the calibration/validation datasets) will be essential for the early physics analyses at the LHC 

4.3 Monte Carlo Tuning 

Most of the analyses at hadron colliders use parton shower Monte Carlos in order to model the signal and 
background events, unfold the detector effects and to extract the physics quantities of interest. These 
parton shower Monte Carlo programs have numbers of parameters that need to be tuned on the real 
data, e.g., parameters that control the initial and final state radiation, jet fragmentation, hadronization, 
and underlying event. Well-tuned Monte Carlos are essential for a precise measurement and proper 
comparison with theoretical predictions. The good modeling of jet fragmentation properties and of the 
underlying event in Monte Carlo events is crucial for corrections of jet energies from the calorimeter level 
to the hadron level. Also, in recent inclusive jet cross section measurments by CDF [l3l HH HH |16] the 
effects of underlying event and hadronization are estimated using leading-order parton-shower Monte 
Carlo generators based on the variation of the predicted jet cross sections after turning off the interaction 
between beam remnants and the hadronization in Monte Carlo events. Several measurements performed 
at the Tevatron, which are used for either tuning Monte Carlos or validating the tunings, are discussed 
below. 

4.3.1 Dijet Angular Decorrelations 

D0 made a measurement of the azimuthal angle between the two leading jets, A0, in Run II [17]. This 
provides an excellent testing ground for the study of multiple gluon radiation effects. Studies [HI [IZ] 
have shown that such distributions are not sensitive to underlying event and hadronization effects. 
Near the A0 ~ tt peak the distribution is sensitive to soft radiation with small px, and the tail at small 
A0 is sensitive to hard radiation with high transverse momentum. The measured A0 distributions, 
{l/ajj){dajj/dA(j)), for four different (largest jet px in an event) ranges are shown in Fig. [2^ along 
with the predictions from Herwig and Pythia. Jets are reconstructed with the Midpoint cone algorithm 
with a cone radius of Rcone = 0.7. The measured distributions have a sharp peak at A0 ~ tt and the 
peaks are sharper at higher jet pt mainly due to the running of asipr)- The default Herwig predictions 
give a reasonable description of the data over the whole A(j) range in all p^"-^ regions; however, the 
default Pythia gives sharper distributions than data in all p^"^ regions, and Pythia provides a better 
description of the data when ISR is enhanced (see Fig. [^2]) . The predictions are found to be insensitive 
to FSR tunes, and the measurement provides a good tool for ISR tuning. Monte Carlo predictions from 
Sherpa [H] and Alpgen [l9l [50] have also been tested against this data, as have NLO pQCD predictions 
from NLOJET++ [35]. They were found to give a reasonable description of the data. 

4.3.2 Underlying Event 

Many of the important observables at a hadron collider, including jets, are sensitive to the underlying 
event, and so a good understanding of the underlying event is needed for precision measurements. A 
series of studies have been made on underlying events by CDF in Run I [2|, [51] and Run II [6l [52] . 
These studies made use of the topological structure of hadron-hadron collisions to study the underlying 
event; measurements were made on event activities in the "transverse" region with respect to the jet 
axis in jet events; this region is most sensitive to the underlying event. The geometry of one study is 
shown in Fig. [23], where the "transverse" , "towards" and "away" regions have been defined with respect 
to the direction of the leading jet. The TransMAX (TransMIN) region refers to the transverse region 
containing the highest (lowest) scalar pt sum of charged particles. The study made use of two classes 
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Figure 22: A</) distributions in different p™"^ ranges over A(/) > 7r/2 (a) and in the pealc region A(/) > 
137r/16 (b). Predictions from Herwig and Pythia are overlaid for comparison. Pjd;liia predictions are 
sliown for the PARP(67) parameter varied between 1.0 and 4.0. Figures from Refs. [6l W7\. 



of events, (1) "leading jet" event, in which there is no restriction on the second and third jet, and (2) 
"back-to-back" events, in which there are at least two jets with pt > 15 GeV, the leading two jets are 
nearly back-to-back (A0(jetl,2) > 150°) with prii et2) / prii etl) > 0.8, and PrijetS) < 15 GeV/c. 

The density of transverse momentum carried by charged particles, dpx / drjdcf), in both the TransMAX 
and TransMIN regions in the leading jet events and back-to-back events is shown in Fig. [2H In the 
"back-to-back" events, contributions from hard components (initial and final state radiation) to the 
"transverse" region are suppressed, and the sensitivity to the underlying event is increased. In the 
study, the TransMAX and TransMIN regions were also used in order to better separate the hard 
components from the underlying event. 

We expect that the MAX region will pick up the hardest ISR or FSR, and thus the MIN region will 
be more sensitive to the underlying event, which is indicated in Fig. [2H dpT/d'r]d(j) in the MAX region 
increases with increasing leading jet px in leading jet events but, on the other hand, dpT/d'qdcj) in the 
MIN region stays rather flat with leading jet pt- Please note that NLO can contribute, by definition, 
only to the transMAX region, and not to the transMIN. At least in the measurments at the Tevatron, 
the transMIN region has a level similar to that of an active minimum bias event, so the results indicate 
that higher order (beyond NLO) radiation effects are relatively small. Fig. YIM shows that contributions 
from FSR and ISR are clearly suppressed in back-to-back events compared with the leading jet events. 
The MAX and MIN dpT/drjdcj) are somewhat falling with increasing jet pt at high jet pt which could 
be due to a saturation of the multiple parton interactions. All these features are fairly well described 
by the tuned Pythia Monte Carlo {e.g., Tune A [53]). 
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Figure 23: Definitions of tlie "toward" , "away" and "transverse" regions. Tlie angle A0 = — 0jct#i 
is tfie relative azimuthal angle between charged particles and the direction of the leading jet. The 
transverse region is defined by 60° < |A0| < 120° and |?7| < 1. Figure from Ref. [6]. 
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Figure 24: The sum of charged particles inside the TransMAX (top) and TransMIN (bottom) 
regions, as a function of the leading jet pt-, in the leading jet events and back-to-back events, in data, 
Pythia Tune A and Herwig (without multiple parton interactions). Figure from Ref. [5^ . 
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4.3.3 Jet Shapes 



The jet shapes, i.e., the energy flow inside jets, in inclusive jet data were studied by CDF in Run I [13] 
and Run II [23]. The jet shape is dominated by gluon emissions from the primary outgoing parton, and 
it depends on the type of the parton originating the jet, i.e., quark or gluon. In hadron-hadron collisions, 
the jet shape is also sensitive to initial state radiation and the underlying event. Fig. [18] in Sec. 13.51 
showed the pt fraction in a cone jet of radius 0.7 that lies in the annulus from 0.3 to 0.7 as a function 
of jet pt'i data are compared with parton shower Monte Carlo predictions. Jets become narrower with 
increasing jet px due to several different factors: (1) power corrections that tend to broaden the jet fall 
as I/pt or (2) the fraction of jets originating from quarks increases with increasing jet pt and (3) 

the probability of QCD radiation decreases as ^^(pr)- The measured jet shapes are well described by 
the Pythia Monte Carlo with Tune A parameters, which were obtained based on the underlying event 
study made in the "transverse" region away from jets. It is the good description of the jet fragmentation 
and underlying event properties by the Monte Carlo events that allows the reliable evaluation of the 
unfolding correction of measurements to the hadron level and also the estimation of the hadronization 
and underlying event corrections. 

4.3.4 Summary 

In this section (Sec. 14. 3p . a few examples of the measurements that were useful for Monte Carlo tunings 
were presented. Most of analyses at the Tevatron experiments rely on parton shower Monte Carlo event 
generators, and they beneflted from these studies. At the LHC experiments, these measurements must 
be repeated since the scaling of these tunings to higher center-of-mass energies are not straightforward. 

4.4 Inclusive Jet Cross Sections 

The inclusive jet cross section has been extensively studied at the Tevatron in both Run I and Run II. 
The differential inclusive jet cross section at the Tevatron provides the highest momentum transfers 
currently attainable in accelerators. It is sensitive to a wide range of new physics, such as quark 
compositeness, and also tests perturbative QCD calculations over more than eight orders of magnitude in 
the cross section. Due to greater statistics compared to Run I and the higher centre-of-mass energy, the 
reach in transverse momentum has increased by approximately 150 GeV (see Fig. [25r a)). Measurements 
of the inclusive jet cross section have also been shown to have a large impact on global pdf analyses |55j . 
especially on the determination of gluon densities at high x, as the inclusive jet cross section has sizable 
contributions from the qg qg subprocess, even at high jet p^, as shown in Fig. [251(b). In the CTEQ6M 
global flt [Hj, the full inclusive jet data sets from the Run I measurements of CDF and D0 were included, 
resulting in the observed enhancement of the gq subprocess compared to the predictions derived from 
CTEQ5M [56] . 

The inclusive jet cross sections measured by the CDF collaboration in Run II are shown in Fig. [26l 
as a function of the jet pt in flve rapidity regions. The measurement uses the Midpoint cone algorithm 
with a cone radius of 0.7 [^ I46p[ A similar measurement was perfomed by D0 as well [57]. For 
comparisons of data to theory, the calorimeter tower energies clustered into a jet must be flrst corrected 
for the detector response and multiple pp contributions. These corrections are discussed in Sec. 14.21 
After an additional correction that accounts for the smearing effects due to the flnite energy resolution 
of the calorimeter (unfolding), the jet cross sections are corrected to the hadron- level. For data to be 
compared to a parton level calculation, either the data must be corrected from the hadron level to the 

^The measurements shown here include the search cone step covered earher in this review, as the measurements without 
the search cone are currently still underway. As discussed, the differences will be relatively small and will be confined 
mostly to lower transverse momentum. 
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Figure 25: (a) The inclusive jet cross sections at ^/s = 1.8 and 1.96 TeV from NLO pQCD predictions 
using the CTEQ6.1M pdf (figure from Ref. [M])- (b) The subprocess contributions to inclusive jet 
production at the Tevatron for the CTEQ5M and CTEQ6M pdfs (figure from Ref. [55]). 



parton level or the theory must be corrected to the hadron level. In this paper, we discuss in the latter 
scheme; the former just involves the inverse corrections. 

These corrections are intended to account for two effects: (1) "underlying event", i.e., the energy not 
associated with the hard scattering, and (2) "hadronization" , i.e., a loss of energy outside a jet due to 
the hadronization process. In recent analyses on inclusive jet cross sections by CDF [l3l HH HH |16] , the 
hadronization corrections are evaluated by comparing the results obtained from Pythia at the hadron 
level to the results from Pythia when the underlying event and the parton fragmentation into hadrons 
has been turned off. The underlying event energy is due to the interactions of the spectator partons in 
the colliding hadrons and the size of the correction depends on the size of the jet cone. It is ~ 1.5—2 GeV 
for a cone of radius 0.7 and is similar to the amount of energy observed in an arbitrarily placed cone of 
this size in minimum bias events with a high track multiplicity. 

The hadronization correction accounts for hadrons outside the jet cone originating from partons 
whose trajectories lie inside the jet cone; it does not correct for the effects of hard gluon emission 
outside the jet cone, which are already accounted for in the NLO predictior|§. The numerical value of 
the hadronization energy is roughly constant at 1 GeV for a cone of radius 0.7, independent of the jet 
transverse momentum. This constancy may seem surprising. However, as the jet transverse momentum 
increases, the jet becomes more collimated; the result is that the energy in the outermost annulus (the 
origin of the hadronization energy) is roughly constant. The evaluation of these two effects are reliable 
only if the Monte Carlo events provide a reasonable description of the underlying event and of the jet 
fragmentation and hadronization properties. Studies on underlying event and jet fragmentation are 
presented in Sec. 14.31 

The two effects (underlying event and hadronization) go in opposite directions, so there is a partial 
cancellation in the correction to parton level. For a jet cone of 0.7, the underlying event correction is 
larger, as seen in Fig. [271 for the case of inclusive jet production at CDF. For a jet cone radius of 0.4, 
the hadronization correction remains roughly the same size but the underlying event corrections scales 

^Such corrections for hard gluon emission are often made, however, if the comparison is to a leading order parton 
calculation, such as for the reconstruction of a tt final state. 
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Figure 26: The inclusive jet cross sections measured with the Midpoint algorithm by CDF in Run 
II 



by the ratio of the cone areas; as a result the two effects basically cancel each other out over the full 
transverse momentum range at the Tevatron. Note also, as illustrated in Fig. [271 that the magnitudes 
of the fractional corrections are relatively insensitive to the value of pt for jet momenta larger than 200 
GeV. Although the fractional changes in the jet pt due to the underlying event and to hadronization 
decrease roughly as 1/pt, the jet cross section itself is becoming much steeper and hence more sensitive 
to changes in the pt- These two behaviors essentially cancel each other and lead to the observed nearly 
constant correction factors. 

A comparison of the inclusive jet cross section measured by CDF in Run II with the Midpoint cone 
algorithm ^31 H6] to NLO pQCD predictions using the EKS [10] program with the CTEQ6.1 |55] and 



MRST2004 [58] pdfs is shown in Fig. [281 A renormalization/factorization scale of (p^^*/2) has been 
used in the calculation. This is the scale at which the Run I jet data were included in the global 
fits [21 [S5], so the same scale should be used for self-consistency. Typically, this choice of the renormal- 
ization/factorization scale leads to the highest predictions for inclusive jet cross sections at the Tevatron. 
There is good agreement with the CTEQ6.1 predictions over the transverse momentum range of the 
prediction. Note that CTEQ6.1M gluon density is already enhanced at large x, as compared to previous 
pdfs, due to the influence of the Run I jet data from CDF and D0. This enhanced gluon provides a good 
agreement with the high pt Run II measurement as well which is extended approximately by 150 GeV 
in jet Pt- The MRST2004 pdfs also contain an enhanced higher x gluon, which leads to reasonable 
agreement with the measurements. The red curves indicate the pdf uncertainty for the prediction using 
the CTEQ6.1 pdf error set. The yellow band indicates the experimental systematic uncertainty, which 
is dominated by the uncertainty in the jet energy scale (on the order of 3% as shown in Fig. [201) . The 
purple band shows the effect of the uncertainty due to the hadronization and underlying event, which 
is visible only for transverse momenta below 100 GeV. In Fig. [291 the jet cross sections measured with 
the Midpoint cone algorithm are shown for the five rapidity regions of the CDF experiment. Good 
agreement is observed in all rapidity regions with the CTEQ6.1 predictions. It is also important to note 
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Figure 27: Magnitude of the fragmentation and underlying event corrections used to correct the 
inclusive jet cross sections measured by CDF as a function of jet pt, for a cone size R = 0.7. 
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Figure 28: The inclusive jet cross section from CDF in Run II compared on a linear scale to NLO 
theoretical predictions using CTEQ6.1 and MRST2004 pdfs. 
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that for much of the kinematic range, the experimental systematic errors are less than pdf uncertainties; 
thus, the use of this data in future global pdf fits should serve to further constrain the gluon pdf. 
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Figure 29: The inclusive jet cross section from CDF in Run II, for several rapidity intervals using the 
Midpoint cone algorithm, compared on a linear scale to NLO pQCD predictions using CTEQ6.1 pdfs. 

While this measurement has been carried out, a new seedless cone algorithm, SISCone [16] (see also 
Section [3.1.3p . has become available. The differences in the inclusive jet cross sections between the 
Midpoint algorithm^ and SISCone algorithm are evaluated in the Pythia Monte Carlo samples both at 
the hadron level and parton level and are shown in Figs. [30] and [3TJ The hadron level inclusive jet cross 
sections are obtained from the Pythia Monte Carlo samples generated with the Tune A parameters, 
and the parton level inclusive jet cross sections are obtained from the Pythia Monte Carlo samples 
generated with the underlying event turned off. The cross sections at these levels are used to determine 
the hadronization corrections. Fig. [30] indicates that the hadron level inclusive jet cross sections are 
different between the SISCone and Midpoint algorithms by 5% at low jet pt with the differences 
decreasing with increasing jet pt] in the meantime, the differences are less than 1% at the parton 
level for any jet px (see Fig. [HT]) . This indicates that the hadronization correction is different between 
the SISCone and Midpoint algorithms by up to 5%; however, the impact on comparisons between the 
measurement and NLO predictions is negligible (less than 1%). 

CDF has also made measurements of the inclusive jet cross sections with the kx algorithms in Run 
II. In Fig. [321 the experimental jet cross sections using the kx algorithm from CDF Run II [15] are 
compared to NLO pQCD predictions using the Jetrad [M] program. Similar to the measurement using 
the Midpoint cone algorithm, good agreement is also observed between data and theory. A comparison 
of these measurements in the central region is presented in Fig. [33] [IS]. In order to require that the 
algorithms use approximately the same size in y — </> space, the cone size for the Midpoint algorithm and 

^For comparisons with the SISCone algorithm, the Midpoint algorithm without the search cone step is used in order to 
investigate the effects of seeds and slight differences in the merging procedure of the overlapping stable cones only. Please 
also note that D0 uses the Midpoint algorithm without the search cone step, and a measurement using the Midpoint 
algorithm without the search cone step is being finalized at CDF too. 
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Hadron Level: Midpoint versus SISCone 




Figure 30: The ratio of the hadron level inclusive jet cross section with the SISCone algorithm to that 
with the Midpoint algorithm in five rapidity regions in Pythia Monte Carlo samples generated with the 
Tune A parameters. 
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Figure 31: The ratio of the parton level inclusive jet cross section with the SISCone algorithm to 
that with the Midpoint algorithm in five rapidity regions in Pythia Monte Carlo samples in which the 
underlying event has been turned off. 
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the D parameter for the kx algorithm will both be taken as 0.7 {Rcone = D = 0.7). It is important to 
note that we expect different predictions on the cross sections for jets clustered with the midpoint and 
kx algorithms when the parameters Rcone and D are set equal. Fig. shows the ratio of the measured 
inclusive jet cross sections for jets clustered with the kx algorithm to the result for jets clustered with 
the Midpoint algorithm. Only statistical errors are shown assuming no correlation between the two 
measurements. The prediction of this ratio from NLO pQCD calculations using fastNLO [59] is also 
shown, and good agreement is observed. As expected, for D = Rcone, the kx cross section, at the parton 
level, is slightly smaller than the cross section using the cone algorithm!^. 

Fig. [34] shows the ratio of the hadron to parton level correction derived with the kx algorithm 
to the one derived with the Midpoint algorithm. These corrections were derived from Pythia Tune 
A [S3] (see Sec. 14.31) . The multiplicative corrections are both smaller than one, so the observed ratio 
indicates that the size of the correction obtained with the kx algorithm is larger (i.e., farther away from 
unity) than the correction obtained with the midpoint algorithm. The consistency of the data-theory 
comparisons between the kx inclusive jet cross section measurement and the Midpoint measurement 
indicate the robustness of the obtained results and adds credence to the fact that the jet definitions are 
made consistently at the parton and detector levels. 



4.5 VK/Z+Jets 

The production oi a.W or Z boson in conjunction with jets is an interesting process in its own right as 
well as a background to many Standard Model (SM) and non-SM physics signals. Jet multiplicities of 
up to 7 have been measured at the Tevatron. Production of W/Z+ jets at the Tevatron is dominated by 
gq initial states. The NLO cross sections have been calculated only for W/Z+up to 2 jets; predictions 
for the higher jet multiplicity final states are accessible through matrix element (+ parton shower) 
predictions and in fact can be considered as a prime testing ground for the accuracy of such predictions 
as well as for measurements of a^. The jet multiplicity distribution for W + n jets measured at the 
Tevatron is shown in Fig. [33 The measurements (and predictions) were performed with a jet cone of 
radius 0.4 and a minimum transverse energy requirement of 15 GeV. A smaller jet cone size is preferred 
for final states that may be "complicated" by the presence of a large number of jets. Also shown in the 
figure are the NLO predictions (for jet multiplicities less than 3), parton shower Monte Carlo predictions 
and LO matrix element + parton shower Monte Carlo predictions. The NLO predictions are able to 
describe the absolute rates for jet production (for up to 2 jets) while the LO matrix element + parton 
shower Monte Carlo predictions can describe the relative jet multiplicity rates. 

A recent measurement of W ^ 61^+ > n jets from CDF is shown in Fig. [361 In this analysis, 
the data have been reconstructed using the JETCLU cone algorithm with a cone radius of 0.4. The 
data have been compared, at the hadron level, to predictions using matrix element information from 
Alpgen [ini [SO], and parton shower and hadronization information from Pythia [1]. The agreement is 
reasonable, although the data has a tendency to be somewhat softer than the predictions. This data has 
been corrected to the hadron level; this makes it convenient for comparison to any hadron level Monte 
Carlo predictioij^. Data corrections to the hadron level (and/or parton level) should be the norm for 
measurements at the Tevatron and LHC, in order for the best interplay between theory and experiment. 



^For the NLO pQCD prediction for jets clustered with the Midpoint algorithm, the Rsep parameter of 1.3 is used. If 
the Rsep parameter is set to 2, i.e., Rsep is not used, the result is a larger difference between the two algorithms at NLO 
(i.e., setting Rsep — 2 results in a larger jet cross section as seen in Fig. |8|). 

^As mentioned before, the corrections for underlying event and for fragmentation basically cancel each other out for a 
cone of radius 0.4, for inclusive jet production, so that the hadron level predictions are essentially parton level predictions 
as well. It was found that the same statement is essentially true also for W-\- jet production. 
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Figure 32: The inclusive jet cross section from CDF in Run II, for several rapidity intervals using 
the kx jet algorithm, compared on a linear scale to NLO theoretical predictions using CTEQ6.1 pdfs. 
Figure is from Ref. [15]. 
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Figure 33: Ratio of the inclusive jet cross section measured with the algorithm to that measured by 
the Midpoint algorithm (black points). The prediction of this ratio from the NLO program fastNLO is 
also shown in the figure (blue line). 
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Figure 34: Ratio of the hadron to parton level correction derived with the algorithm to that derived 
with the Midpoint algorithm. The multiplicative corrections are both smaller than one. The correction 
derived with the fcy algorithm is larger (farther away from unity). The corrections were derived from 
Pythia Tune A. 
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Figure 35: The rate of production of W + n jets at CDF. The measurements and predictions were 
performed with a cone jet of radius 0.4 and with a requirement of 15 GeV/ c or greater. The MCFM [60] 
predictions are absolutely normalized. The CKKW [61] predictions are normalized to the first bin. A 
scale of 10 GeV/c has been used for the matrix element /parton shower matching. 
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Figure 36: A comparison of the measured cross sections for W+ > n jets in CDF Run II to predictions 
from Alpgen+Pythia. The experimental cross sections have been corrected to the hadron level. 

4.6 Heavy Flavor Jets 

Many of the interesting final states at the Tevatron, such as ti decays and H bb, involve fe-quark jets 
(6-jets). W/Z + b and 7 + & processes are also interesting; they are major backgrounds in Standard 
Model Higgs or SUSY searches, and they are also sensitive to the b content of protons. There are two 
main challenges in the analyses that deal with 6-jets; (1) the 6-jet identification (6-tagging) and (2) the 
energy measurement of the 6-jet. 

There are characteristics of 6-jets that differentiate them from light flavor and charm jets: 

• the long lifetime of the b quark 

• the large mass of B hadrons 

• the energetic semileptonic decay of B hadrons 

The algorithms that identify 6-jets exploit these characteristics to separate 6-jets from the other 
jets. The most-widely used algorithm to tag heavy flavor jets at CDF is the secondary vertex algo- 
rithm, often referred to as SecVtx p2l [S3]- Because of their long lifetime, b quarks typically decay a 
measurable distance from the primary interaction point, and so the algorithm reconstructs the decay 
vertices (secondary vertices) using a minimum of two or three tracks with an impact parameter sig- 
nificance {do/a{do)) greater than 3.0 or 2.0, respectively. The impact parameter (do) is the minimum 
distance between the track and the primary vertex in the plane transverse to the beam direction and 
a{do) is its uncertainty. The two-dimensional displacement of the secondary vertex from the primary 
interaction point projected along the jet axis {L2d) is then measured; a jet is b tagged if the vertex 
has L2D significance larger than 7.5, where the uncertainty on L2D includes contributions from both 
the primary and secondary vertex fits [63]. When considering a 6-tagging algorithm, it is important to 
understand how often one tags a 6-jet in the data {i.e., tagging efficiency) and how many of the tagged 
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Figure 37: (a) 6-tagging efficiency for fe-jets in top decays and (b) mis-tagging rate as a function of jet 
Et at CDF (figures from Ref. [63]). 



jets are actually from non-6's (i.e., mis-tagging rate). These features for SecVtx at two operating points 
are shown in Fig. [371 

There are other 6-tagging algorithms developed at CDF. The algorithm called JetProbability con- 
siders df) of each track within a jet and constructs a probability that a given jet is consistent with 
coming from a zero-lifetime source [61]. The soft lepton tagging algorithm identifies 6-jets by looking 
for a semileptonic B hadron decay withing a jet [65]. Efforts to combine these tagging tools using a 
multivariate technique like a neural network are also underway. 

Since 6-jets have quite different characteristics from other jets (light quark or gluon), special care is 
necessary for the energy measurement. The energy correction from calorimeter jets to hadron-level jets 
or to the parent parton is different for 6-jets than for generic jets because of different parton shower 
and fragmentation properties, and also due to the presence of semi-leptonic decays. Both CDF and 
D0 generally rely on MC simulation to model the 6-jet energy scale. In the CDF top quark mass 
measurements [H], additional uncertainties are evaluated for the 6-jet energy scale: (1) uncertainties in 
energy response arising from uncertainty in the B meson semi-leptonic branching ratios, (2) uncertainties 
arising from the imperfect knowledge of the fragmentation properties of 6-quarks, and (3) uncertainties 
arising from the different color flow associated with 6-jets produced in top quark decay. The fe-jet 
energy scale uncertainties from these sources are evaluated by changing the relevant parameters in the 
MC based on the constraints from other experiments, and they yield an additional 0.6% uncertainty in 
total. 

Possible ways to test the 6-jet energy scale in pp data would be to use the photon-6-jet balance or 
the Z ^ bb resonance. D0 recently made a preliminary measurement on 6-jet energy scale in 7+6-jet 
events using the missing Et projection fraction (MPF) method [66]. The MPF method is described in 
detail in Section I4.2.2[ The study suggested that b-jets need additional energy corrections of as much 
as 10% at energies around 20 GeV and about 5% at energies of 150 GeV. 

CDF and D0 have also extracted the Z — > 66 signal and are seeking to use it to test and calibrate 
the 6-jet energy scale [671 [68]. At CDF, a dedicated trigger was implemented to collect a large sample 
of Z decays to 6-quark pairs by requiring two tracks with displaced vertices and two jets. Dijet events 
with both jets being tagged as 6-jets were selected offiine. The signal was extracted by fitting the data 
with the QCD background shape computed using untagged data passing the same kinematic selection 
together with the Z ^ bb signal and the Z —>■ bb signal shape computed with Pythia. The extracted 
Z ^ bb signals by CDF and D0 are shown in Fig. [381 The measured data/MC scale factor for 6-jet 
energy scale for CDF is A; = 0.974 -|- 0.020 — 0.018(stat © syst). The achieved uncertainty of ~ 2% is 
smaller than the convolution of the generic jet energy scale and additional 6-jet specific uncertainties of 
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~ 4%, and thus the Z bb signal can provide a good constraint on the fc-jet energy scale. The signal is 
also expected to serve as a tool to test 6-tagging algorithms and also for improvements in the jet energy 
measurement algorithms which will be discussed in Sec. 14. 7[ 




Figure 38: (a) the bb dijet mass distribution measured in CDF data (points) and the estimated QCD 
bb background (green) and Z ^ bb signal shape (red). The inset shows data minus background dis- 
tributions compared to the Z ^ bb signal shape, (b) Z ^ bb peak extracted from D0 data (points), 
compared to the shape of Z ^ bb distribution in MC (histogram). Figures from Refs. [671 168] . 



4.7 Particle Flow Type Approaches 

In the Tevatron experiments, by default, jets are reconstructed by running a jet clustering algorithm 
on energy depositions in the calorimeters, i.e. basically only calorimeter information is used in the 
standard jet reconstruction and energy measurement. The HI collaboration at HERA [69] and OPAL 
collaboration at LEP [70], among others, have successfully used an algorithm that incorporates tracks 
in jet reconstruction, and such an algorithm has been tested at CDF and D0 as well. As discussed, 
the energy measurement of hadronic particles by calorimeters suffers from large fluctuations and non- 
linearities; tracking information can be used to reduce such effects and to improve the energy resolution 
in jet energy measurements. There is also an attempt in CDF to further improve the jet energy 
measurmenet by using the information from the shower-max detector (a wire chamber placed near 
the shower maximum position in the central electromagnetic calorimeter) to sort out the overlapping 
particles like with vr^ [71]. Although such methods are not yet mature enough to be used in any 
physics analysis, they could be helpful in achieving our goal of percent precision and should be pursued 
further. 



5 Jets at the LHC 

The experience gained at the Tevatron is extremely useful in the preparation for physics analysis with 
jets at the LHC. However, hard scattering at the LHC is not just "re-scaled" scattering from the 
Tevatron. Many of the interesting physics signatures will take place with relatively low x partons and 
thus there will be a dominance of gluon and sea quark interactions, as compared to interactions involving 
valence quarks. In addition, as the initial state partons are at low x, there is enormous phase space for 
gluon emission, and so a large probability for additional jets from initial state radiation. The underlying 
event is also expected to be enhanced compared to the activity observed at the Tevatron, largely through 
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Figure 39: NLO Inclusive jet cross section predictions for the LHC using the EKS [TO] program and the 
CTEQ6.1 central pdf and the 40 error pdf's(left); the ratios of the jet cross section predictions for the 
LHC using the CTEQ6.1 error pdf's to the prediction using the central pdf (right). 

an increased rate of semi-hard multiple-parton interactions. These interactions will contribute to the 
energy measured in jets produced in the hard scatter, and may often lead to the production of extra 
low transverse momentum jets in their own right. Thus, the LHC will be a very "jetty" environment 
and accurate measurements of the dynamics of the hard scattering may be challenging [T] . There is 
then a need for tools even more powerful than the ones used at the Tevatron to reconstruct jets. The 
most interesting tools focus on the reconstruction of jet shapes, thus exploiting the significantly finer 
readout granularity of the LHC detectors. In this section we summarize some of the related aspects 
presently under study, and show some expectations for the jet final state at LHC and its representation 
in the detectors. 

5.1 Expectations for Jet Final States 

As already mentioned, jets will be generated in basically all final states expected in pp collisions at 
^/s = 14 TeV at the LHC. For most channels they are going to be the dominant part of the detectable 
signal, thus providing major input to the reconstruction of the event kinematics. Precision requirements 
on the jet energy scale are high compared with the Tevatron, with systematic uncertainties of less than 
1% absolute needed/expected for jets reconstructed in ti production, or for jets generated at the end of 
long decay chains in certain SUSY models. 

The kinematic reach of jets produced in QCD 2^2 processes at the LHC greatly extends that 
possible at the Tevatron; for example, compare Fig. [25] in section U3] with the predictions for the LHC 
shown in Fig. [39l Even in the initial lower luminosity running phase, the jet production rate is also 
very high, as is summarized in Tabled! It is interesting to note that the pdf uncertainty for jets at the 
LHC at the highest attainable transverse momentum is similar to that for the Tevatron at its highest 
attainable transverse momentum. 

The partonic phase space available in pp collisions at the LHC allows for a large amount of extra 
radiation, as can be observed in the number of jets calculated as a function of the leading jet px in 
QCD 2 — > 2 processes (see Fig. HOj). For low pt the number of additional jets is suppressed by the pr 
cut applied to jets being non-negligible compared to the transverse momentum of the hard scattering, 
while the drop towards higher px indicates that radiation is suppressed due to the increasing dominance 
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Table 1: Expected inclusive jet production rate at LHC in one run year with an integrated luminosity 
/ Cdt = 10 fb-^ 
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of qq in higher partonic x scatterings, and the subsequent lower color factors in the collision. There is 
also an extra suppression due to the higher x values of the incident partons. 

5.2 Jet Physics Environment at the LHC 

Jet measurements are affected by the presence of multiple semi-hard interactions from other parton- 
parton pairs from the proton-proton collision of interest, i.e. the underlying event (UE) of the same 
collision. In addition, the physics environment at the LHC is also affected by additional minimum 
bias (MB) collisions of other proton pairs in the same bunch crossing. Both effects limit the efficiency 
for reconstructing the hadron-level jets (and ultimately the parton-level jets) from the hard scattering 
(signal) and also add complex features to the already non-trivial detector jet signals. They are also 
major sources for uncertainties in the present simulation-based performance estimates for LHC physics. 

5.2.1 The Underlying Event at the LHC 

There is a great deal of uncertainty in the level of underlying event activity expected for pp collisions 
at ^/s = 14 TeV, as can be observed in Fig. HH This uncertainty is a major factor in estimating the 
quality of the reconstruction of the jet signals. Estimates derived from a recent tuning of Pythia |1] 
("Pythia 6.214 CDF Tune A"), discussed previously in this paper, actually predict significantly fewer 
tracks from the UE at the LHC than the parameters previously used in the same Pythia version by 
ATLAS [73| ("Pythia 6.214 ATLAS" in Fig. SI]). The determination of the level of this underlying event 
activity will be one of the first measurements to take place upon startup of the LHC. 

An ATLAS study similar to the one carried out by CDF (and shown in Fig. [23]) indicates that the 
sum of the transverse momentum of the charged particles in the transverse region (see Fig. [23]) will vary 
from approximately 10 GeV/c for low jet transverse momentum to over 30 GeV/c for jet transverse 
momenta larger than 1 TeV/c [73] . 

5.2.2 Minimum Bias Events and Pile-up 

Contrary to the underlying event energy, the additional minimum bias interactions present at higher 
luminosities provide a source for px flow in the LHC collisions which is not correlated at all with the 
(triggered) hard scattering of the signal event. Its total contribution is dependent on the instantaneous 
luminosity. Assuming a total pp cross-section of about 75 mb and excluding single diffractive and double 
diffractive collisions, one can expect an average number of MB collisions of ~ 23 (Poisson-distributed) at 
the design luminosity [L = 10'^^cm~^s~^), ~ 4.6 at the "initial" luminosity L = 2.0 x lO^^cm'^s"^, and 
~ 0.02 at the LHC startup0 {L = 10'^^cm~^s~^, the most recent expectation). This activity is therefore 

^°These estimates assume that the bunch crossing time is 25 ns, with about 3000 bunches in LHC. Less frequent bunch 
crossings, and a smaUer number of (longer) bunches at the same stored current, as recently discussed for initial LHC 
running, can increase the pile-up significantly, even at lower luminosities. 



49 



10.0 
8.0 
6.0 
4.0 
2.0 
0.0 



Cone Rcone = 0.7 



■-0-- 



-O- "O- I 



-o- 



• p™" =7 GeV/c - 
o Pt™" = 20 GeV/c 



Cone Rrnne. = 0.4 



■-0-- 



-o- 



■-0- 



oo 



--o-o 



o 



o 



pmin ^ 7 GeV/c 



o p;- 



20 GeV/c 



> 

03 



a; 

-9 8.0 



fcf D = 0.6 



h D = 0.4 



• •- • • 



6.0 
4.0 
2.0 
0.0 



-o-o 



„ o 



o 



-o- -o- 



-o- 



o-o- 



-o- 



-o- 



pmm ^7 GeV/c 



-o- 



o pr 



20 GeV/G- 



o 





-o- - p™'" = 7 GeV/c - 

o p;""' = 20 GeV/c- 




1.0 1.5 2.0 2.5 3.0 3.5 1.0 1.5 2.0 2.5 3.0 3.5 

lOQio (Pi/GeV) (leading jet) 



Figure 40: Number of hadron jets in simulated QCD 2^2 LHC events, as a function of the leading 
jet pt, for various jet finders. The shapes of the curves can be understood as a convolution of the pt 
cut applied in jet finding, the increasing probability for extra hard gluon emission with the increasing 
hardness of the 2 — »• 2 scattering, the change from gluon to quark jets with increasing pt, and finally 
the restricted phase space for extra jets accompanying very energetic primary jets. 
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Figure 41: Number of charged tracks in the transverse region of the QCD 2^2 interaction plane (as 
defined in Fig. [23]) as a function of the leading jet pt- Data from CDF [2] are shown together with model 
predictions for the LHC (figure from [73]). 
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Figure 42: Simulated forward going quark jets from Higgs production via WW scattering, without (left) 
and with pile-up (right) in the ATLAS forward calorimeter. The signals with pt < are due to the 
specific choice of bi-polar signal shaping functions for this detector. The signals in the central detector 
regions are omitted here for clarity. 

likely negligible at the LHC start-up, but can produce a large number of non-signal tracks/energy at 
the design luminosity. 

The actual detailed effects of the minimum bias events on the jet signal depend strongly on the 
calorimeter technology and readout electronics. For example, the rather slow signal formation in the 
ATLAS liquid argon calorimeters with typical charge collection times of about 500 ns (which has to 
be compared to the LHC bunch crossing time (25 ns)) and the rather large pp cross-section leads to 
a history of previous collision signals still visible in the actual event. The effect averages to energy, 
due to the specifically chosen bi-polar (canceling area) signal shaping, but the "out-of-time" pile-up 
adds to the fluctuations. Fig. shows the simulated response to two rather low pj- jets from vector 
boson fusion (VBF) Higgs production in the ATLAS forward calorimeters with and without full LHC 
luminosity pile-up added. Both jets are well visible above the noise, but their shapes, as well as 
their signal amplitudes, are changed, which makes calibration in this particularly hostile region very 
challenging. A more quantitative estimate for the signal fluctuations introduced by pile-up in jet cones 
in shown in Fig. US] [Zl] • 



5.3 Experimental Aspects of Jet Reconstruction at the LHC 

The large calorimeter systems in ATLAS and CMS at the LHC are the basic detectors for jet recon- 
struction. Both systems provide hermetic coverage up to pseudorapidities of ~ 5. Cell sizes and readout 
granularity vary widely within each of the systems and introduce different limitations on the calorimeter 
signals used for jet finding and reconstruction. In general, though, the cell sizes are smaller than in the 
Tevatron calorimeters, allowing for the development of more powerful jet clustering software. Typical 
depths of the calorimeter systems exceed 8 — 10 absorption lengths. Fig. |44] gives an overview on these 
detectors, with some of the relevant details described below. 
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Figure 43: Estimated signal fluctuations from full pile- up simulations, calculated as RMS in px for the 
central and endcap region in ATLAS, as a function of the square root of the jet cone area. For a typical 
cone size Rcone = 0.7 the fluctuation are about 12 GeV in both detector regions (taken from [7i]). 
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Figure 44: The ATLAS calorimeter system (left) and CMS detector system (right). 
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5.3.1 Brief Look at the ATLAS and CMS Calorimeters 



CMS features a highly granular central electromagnetic lead-tungstate (PbW04) crystal calorimeter 
with very small lateral cell sizes [Ar] x A0 = 0.0174 x 0.0174 within |?7| < 1.479). The calorimeter 
consists of 61200 individual crystals and has a total depth of 25.6 radiation lengths (Xo). The crystals 
point back to the vertex, within a very small tilt. The electromagnetic endcap calorimeters also use 
PbW04 crystals, but with a coarser granularity and a rectangular pattern of fixed-sized crystals. The 
depth of these crystals is about 24.7 Xq. Each endcap covers the region 1.479 < \ri\ < 3.0. 

Hadronic calorimetry in CMS features tiled scintillator read-out with brass absorbers. The central 
hadronic calorimeter is arranged outside the central electromagnetic calorimeter, but still inside the 
solenoid magnet. It has a granularity of Arj x A(f) = 0.087 x 0.087 in one depth segment within 
|?7| < 1.4. The hadronic outer detector, a scintillator layer attached to the outside of the solenoid 
magnet, provides additional depth coverage for hadrons. The hadronic endcap calorimeters feature 
the same readout technology with decreasing granularity in r], starting with At] x A0 = 0.087 x 5° at 
|?7| = 1.3 to Ar] X A(f) = 0.35 x 10° at |?7| = 3.0. The gap to the beam pipe is closed by a steel/quartz- 
fiber forward calorimeter with typical granularity of At] x A0 ^ 0.175 x 10°. All CMS calorimeters 
are non-compensating and require specific calibrations for hadrons and jets. More details on the CMS 
detector can be found in [75] . 

The ATLAS calorimeter system features a central highly granular electromagnetic liquid argon/lead 
accordion calorimeters with a pointing readout geometry. Cell sizes vary from At] x A0 = 0.003 x 0.1 
in the first depth segment to At] x A(f) = 0.025 x 0.025 in the second, and Arj x A0 = 0.5 x 0.025 in the 
third depth sampling. Depending on rj, the depth varies from 26 to 36 Xq. The central electromagnetic 
calorimeter is, contrary to the setup in CMS, located outside the solenoid magnet. The calorimeter 
covers a region of |?7| < 1.475. 

The electromagnetic endcap calorimeter in ATLAS features a liquid argon/lead accordion-type 
calorimeter with the absorber folded like a Spanish fan. It covers 1.375 < |?7| < 3.2, with three 
depth samplings up to |?7| = 2.5, and two in 2.5 < |?7| < 3.2. The lateral size of the pointing cells is 
Ar] X A0 = (0.003 — 0.006) x 0.1 in the first sampling, Arj x A0 = 0.025 x 0.025 in the second, and 
Ar] X A0 = 0.05 x 0.025 in the third sampling, all up to \r]\ = 2.5. Beyond that, the two remaining 
samplings of the electromagnetic endcap calorimeter have Ar] x A0 = 0.1 x 0.1. 

Hadronic calorimetry in ATLAS is provided by the steel/scintillator tile calorimeter in |?7| < 1.7, 
which has three samplings with quasi-projective cells of Ar] x A0 = 0.1 x 0.1 in the first two, and 
Ar]xA(j) = 0.2x0.1 in the last depth sampling. The endcap hadronic calorimeter is a liquid argon/copper 
parallel plate calorimeter with four depth samplings and quasi-projective cells with Ar] x A(j) = 0.1 x 0.1 
in 1.5 < \r]\ < 2.5, and Ar/ x A0 = 0.2 x 0.2 in 2, 5 < \r]\ < 3.2. The net resuh is that the ATLAS 
calorimeters have six or seven depth samplings for hadrons, depending on the particle direction. 

The ATLAS forward calorimeter covers 3.1 < |?7| < 4.9 and consists of three modules. The first 
(electromagnetic) module is a liquid argon/copper calorimeter featuring tubular thin gap electrodes. 
The two hadronic modules have a tungsten absorber with a similar electrode geometry. The readout of 
the forward calorimeter is organized in non-projective rectangular cells, with an approximate cell size 
of Ar] X A0 = 0.2 x 0.2. The total number of calorimeter channels in ATLAS is close to 200, 000. And 
like in CMS, all ATLAS calorimeters are non-compensating as well. For more details, see [72] . 

5.3.2 Calorimeter Jet Basics 

CMS uses projective calorimeter towers on a grid of Ar] x A0 = 0.1 x 0.1 as input to jet finding. Only 
towers with > 1 GeV/c are considered. The towers correspond to massless 4- vectors by definition, 
i.e., their kinematic contribution is fully specified by their transverse momentum px, their rapidity y, 
and azimuth [76]. In ATLAS, two different calorimeter signal definitions are used as input for jet 
reconstruction. As in CMS, projective cell towers with Ar] x A0 = 0.1 x 0.1 are used, but without 
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Figure 45: Jet reconstruction flow in ATLAS. The left diagram shows the sequence for the tower based 
reconstruction, while the right diagram shows the cluster based reconstruction sequence. 
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any restriction on the actual value of px- This means it is possible to have towers with a non-physical 
four-momentum like E < ^ Pt < 0, with rj, (j), and m = fixed by location and/or definition. The 
negative tower signal is generated by electronic noise as well as by signal fluctuations from the MB 
(pile-up) events. 

On the other hand, the negative energy towers cannot be accepted by the jet finders, as the four- 
momentum recombination of protojets (the towers) requires legitimate, physically allowed kinematic 
variables. This introduces the need for noise compensation or suppression. In ATLAS, a noise com- 
pensation based on the pre-summation of towers into protojets has been introduced, which is based on 
the idea that negative signal towers are merged with neighboring positive signal towers until the total 
energy in the corresponding protojet is above 0. Negative towers without any close-by positive signal 
towers are dropped. 

ATLAS also uses three-dimensional topological calorimeter cell clusters as input to jet finding. The 
basic idea of this signal definition is the attempt to reconstruct calibrated energy blobs in the calorimeter 
using the energy flow and shower development correlation between neighboring cells. The result makes 
optimal use of the fine granularity in ATLAS, especially of the longitudinal segmentation. The clustering 
algorithm is based on cell signal significance, as measured by the signal-to-noise ratio S/a, where the 
noise a can include pile-up fluctuations. Different thresholds are requested for cells seeding a cluster 
(primary seeds, \S/a\ > 4), cells defining the growth of a cluster (secondary seeds, \S/a\ > 2), and 
cells to be included because they are direct neighbors of any one of these two seeds {S/a > 0). Note 
that large negative fluctuations can seed a cluster as well. This has been introduced to have an average 
cancellation of positive noise contributions. Naturally clusters with negative total signal are not used 
in jet reconstruction, but negative signal cells within a cluster with positive total signal contribute to 
the jet signal. The algorithm initially puts all topologically connected cells into one cluster. In a second 
pass, the clusters are analyzed with respect to local signal maxima. Clusters with more than one local 
signal maximum are split, and the energy in the cells between the two signal peaks is typically shared 
by the resulting two clusters [77] . 

Most clusters have measurable shapes with respect to location, longitudinal and lateral extension, 
energy sharing in calorimeter compartments, etc. Variables describing these shapes are used to fully 
calibrate the clusters to a local hadronic energy scale. This procedure includes the attempt to first 
classify each cluster with respect to the character of its generating particle(s), followed by the appli- 
cation of hadronic weighting functions for "hadronic-looking" clusters. These functions are typically 
parametrized using shape and location variables {e.g., the depth of cluster center in the calorimeter) 
sensitive to hadronic shower development. Next, corrections for dead material losses are applied to all 
kinds of clusters found near cracks and dead material in the calorimeter. The final cluster correction 
then attempts to recover signal losses outside of clusters, as introduced by the noise cuts used in the 
cluster formation algorithm discussed above. Note that none of these procedures use a jet context at 
all. All calibrations and corrections here are derived from single pion and electron signals alone, and 
can thus be bench-marked with experimental test-beam data. 

Fig. |45] shows a schematic overview on calorimeter jet reconstruction from towers and clusters in 
ATLAS, respectively. Both calorimeter signal definitions have their specific advantages and disadvan- 
tages when used in jet reconstruction. Any particular choice of the calorimeter signal has serious effects 
on jet reconstruction and calibration, though, as discussed more in the following sections. 

5.3.3 Calorimeter Jet Calibration 

Several jet calibration models are under investigation in ATLAS. The most commonly used model is 
based on a modified cell signal weighting technique, following the original suggestions from HI [69]. It 
can be applied to jets from towers as well jets from uncalibrated topological clusters. 

In a first step, the cell content of the tower or cluster jet is retrieved. Then, a cell signal weighting 
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function Wc is applied for each cell, depending on its locatioiJl^ Xceii and the cell signal density pceii = 
Eceii/Vceih where Eceii is the cell signal on an initial (electromagnetic) energy scale, and Vceii is the 
physical volume of the cell. Finally, the jet kinematics is re-calculated using the now calibrated cell 
signals: 

^je.t = WciXcell, Pcell)Pcelh 

cellsGjet 

with Pceii = {E cell iP cell) a^d \E(.eii\ = \'Pceii\- Notc that in this scenario the direction of the original jet 
can change. 

The weights can be determined using the particle "truth" jet in simulations by adjusting them such 
that the reconstructed jet energy on average is identical to the matched particle jet energy. In this 
scenario the weights reflect all corrections needed to reconstruct the particle jet. In particular, energy 
lost in particles which do not reach the calorimeter due to the magnetic field is compensated. In the 
same sense, upstream energy losses in dead material are corrected by this normalization choice. 

Using the calibrated topological clusters as input for jet finding makes the cell signal-based calibra- 
tion in the jet context, as discussed above, obsolete. In this approach the cluster signals are already 
fully calibrated in a less biased context, as for example the jet algorithm choice does not enter into 
the hadronic calibration at this level (see discussion at the end of section [5.3.2p . To first order, the 
reconstructed jet kinematic is then given by the sum of the calibrated cluster four-momenta. Note 
that the cluster calibration does not account for energy losses due to the magnetic field and some dead 
material, as obviously only energy losses with some correlation to the shape or magnitude of a nearby 
cluster signal can be corrected. Especially energy lost away from any cluster, and energy losses due to 
the loss of small signals by the implicit noise suppression applied in the cell clustering algorithm, still 
need to be corrected for in the (larger) jet context. An attempt is under study in ATLAS to correct for 
these effects jet by jet, possibly using measurable jet shapes and detailed cluster information, but not 
all sensitivities and useful variables are yet fully understood to achieve a particle level calibration this 
way. 

Important validation signals for jet calibration at the LHC, as at the Tevatron, are prompt photons 
and hadronic W decays. For systems which have a photon balancing one or more jet(s), corrections 
can be extracted from the pt balance. Uncertainties in this procedure mostly arise from initial and 
final state radiation, and the underlying event activity, all of which may limit the applicability of these 
corrections in different collision topologies. The W mass inW ^ qq can be used for the same purpose. 
Again, care is required when applying the corrections derived from the mass constraint to other physics 
topologies. At the LHC, most of the hadronically decaying Ws in the recorded data will be reconstructed 
in the context of ti events, which are very busy final states with potentially large amounts of energy 
"accidentally" scattered into the jet(s). Also, the W is color-disconnected from the rest of the event, 
which changes the general px fiow around its decay jets. 

5.3.4 Use of Tracks in Jets 

Reconstructed tracks from the inner tracking detectors in both ATLAS and CMS, in principle, can 
be used to better calibrate and characterize a given reconstructed jet. Classical energy fiow based 
reconstruction techniques combine a track with the calorimeter response (typically one cluster) and 
make use of the feature that the track provides much better energy resolution than the calorimeter 
for lower energy particles. The application of these techniques promises considerable improvement 
in the kinematic reconstruction of a single isolated particle like an electron or r; the application to 
jets is still under study. In particular, the "subtraction" of the charged response from the calorimeter 

^^The cell location is typically indicated by calorimeter module and sampling identifiers, together with a cell index, 
rather than absolute coordinates, for example. 
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Figure 46: A simulated QCD 2-^2 event with two hard cone jets (1 and 2) with energies around 2 TeV 
each and two softer jets with energies around 20 GeV (3 and 4) in ATLAS. Same colored bins belong 
to the same jet. The change of the jet shape clearly depends on the calorimeter signal definition. 

jet signal is much more challenging due to the already mentioned overlap of showers and generally 
high tracker occupancy in the jet case. There are indications, however, that the fraction ft^k of the 
jet energy carried by (reconstructed) charged tracks into a jet is already a useful variable to refine 
the jet calibration, even without individual track/cluster matching. If ftrk is large for a given jet, the 
corresponding calorimeter response has a larger contribution from hadronic showers and may deserve 
additional calibration corrections to improve the jet energy scale, especially with a precision requirement 
at the level of 1%. 

5.3.5 Jet Algorithms 

Both ATLAS and CMS use the iterative seeded cone jet finder with Rcone = 0.4 (ATLAS) and Rcone = 
0.7 (ATLAS and CMS), and the kx jet finder with D = 0.4 and D = 0.6 (ATLAS), and D = 1 in 
CMS. Other jet algorithms are generally available and under study in particular with respect to certain 
physics needs (see the discussion in Section E]). The software is implemented such that exactly the same 
code runs on all possible input objects (partons and hadrons from Monte Carlo generators, detector 
signals like clusters, towers, reconstructed tracks, etc.), as long as those represent a full four-momentum 
measure. 

5.4 Jet Signal Characteristics at the LHC 

The jet topology as it unfolds in the calorimeters is basically driven by the combination of calorimeter 
absorption characteristics, the chosen signal definition and jet algorithm choice. This can be seen in the 
Pythia event in Fig. HHl The tower picture of this particular final state shows rather large jets in general, 
especially when compared to the hadron jet. This is partly a consequence of the re-summation discussed 
above, but also due to the fact that towers especially in the endcap and forward regions are filled with 
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Figure 47: The number of constituents for hadron, cluster, and tower kj- jets {D = 0.6) as a function 
of the jet pt for simulated central events in ATLAS, in various regions of rapidity y (left). The right 
figure shows the number of clusters versus the number of particles in matched cluster and hadron jets 
in the central and endcap region, from the same simulated data. 



signals more generated by the lateral shower development (see jet 4 in Fig. HHD than the hadron energy 
flow. Topological clusters, on the other hand, collect spatially distributed cell signals, as can again 
be seen very well in jet 4. In the cluster picture this jet is very coUimated in rapidity, and consists 
of only two clusters. This reflects the rather coarse cell readout granularity of the ATLAS forward 
calorimeters, which suppresses cluster splitting due to lack of resolvable (lateral) signal structures. In 
the highly granular central and endcap regions the cluster jet shapes match very well the hadron jet 
shape, as can be seen from jets 1 to 3 in this figure. 

The particular choice of the calorimeter signal definition to be used in jet reconstruction affects the 
ability to reconstruct some of the jet kinematics, such as the jet mass. For example, for cone jets with 
R = 0.7 made from calorimeter towers with Arj x A0 = 0.1 x 0.1, the number of constituents N^. is 
given by 

~ cone ^ 

Ar/ X A0 = 0.1 X 0.1 ^ 

independent of the jet direction and energy. Calorimeter signals such as the topological clusters, which 
are more sensitive to the hadron level composition of the jet, typically generate jets with at least 
indirectly related to the number of incoming particles (see Fig. HTj). Naturally, the incoming particle 
energy flow is convoluted with possibly overlapping shower developments and distributed onto a finite 
readout granularity. The relation between cell sizes and electromagnetic and hadronic shower sizes puts 
limitations on the reconstruction of the original incoming particles, and defines the image of the jet in 
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Figure 48: The change of the jet mass for a constituent selection based on their transverse momentum 
p^nst ^p^nst ^ p™«n)^ for kx jets from particles, towers, and clusters {D = 0.6), as a function of the jet 
mass calculated from all constituents. 



the calorimeter. Fig. HT] indicates that in the central region of ATLAS the cell sizes, even though small 
iny/rj and 0, are comparably big with respect to shower sizes, while in the endcap region the cell sizes 
get sufficiently small, thus improving the structural resolution power of the cluster jets. 

One of the more interesting jet variables is its mass. The ability to reconstruct this mass within 
reasonable precision has considerable impact on the general reconstruction of heavily boosted systems 
like the top quark, where all final state decay products may end up within a typical jet cone. In 
this case the jet mass is the only reconstructable observable giving any indication of the nature of 
the decaying system. The mass reconstruction is still nearly perfect at the hadronization level, at 
least if all final state particles belonging to a given parton are efficiently collected by the chosen jet 
algorithm. Reconstructing the mass from calorimeter signals is much more challenging in that not 
only the showering and the resulting signal overlap in the calorimeters smear the mass measurement 
considerably, but also the fact that the solenoidal fields in front of the calorimeters (~ 2 T in ATLAS, 
~ 4 T in CMS) bend charged particles with pt < (400 — 800) MeV/c away from the detectors, i.e. 
outside the jet cone. In addition, the unavoidable amount of dead material typically introduced by the 
inner detectors and their services as well as calorimeter support structures and cryostat walls (ATLAS 
only), can significantly reduce the low energy photon signal. This effect has been addressed in a brief 
study for ATLAS, where the jet mass variation 6m/m introduced by excluding jet constituents below 
certain thresholds, starting from 100 MeV up to 2 GeV, is calculated from QCD di-jet simulations for 
particles, and the corresponding simulated calorimeter tower and cluster signals with the same cuts 
applied. Fig. HH] summarizes the results of this study, which indicate that the cluster signals follow the 
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Figure 49: Estimates from simulations for the ATLAS detector for the energy fraction outside of a cone 
with r = 0.3 around the jet axis, as a function of the jet px for (left) and cone (right) jets, in two 
different regions of rapidity y, for jets in QCD 2^2 processes. 



effect at particle level quite well, if compared to tower jets. 

Jet reconstruction is therefore in general affected by magnetic field effects, the upstream energy 
losses from dead materials, showering, leakage, calorimeter regions of low efficiency (cracks), and the 
underlying and pile-up event activity. Any particular sensitivities of a given jet algorithm to any of these 
effects can be enhanced or suppressed by a specific calorimeter signal choice, e.g. towers or clusters. 
The jet energy scale calibrations and corrections applied to recover the corresponding energy losses 
are as much as possible factorized for better control of systematic uncertainties. The magnitude of the 
contribution of any of these effects to the jet energy scale and jet shape reconstruction varies, depending 
on the combination of the calorimeter signal choice and the chosen jet algorithm. 

One example for an experimentally accessible observable reflecting some of these sensitivities is the 
jet energy density measure ip{r), which is the fraction of energy contained in a cone of radius r within 
a jet. Fig. H9] shows that towers and clusters in the ATLAS calorimeter are expected to produce very 
similar densities for high px jets, but show some differences for jets with pt ^ 100 GeV/c. Hadron jets 
at lower pt are significantly broader, i.e., have a larger fraction of the jet energy away from the center 
of the cone, than either tower or cluster jets. In general the shape of ipipr) in different rapidity regions 
changes, depending on the jet algorithm choice, in a similar way for all three kinds of jets. 



6 SpartyJet 

As we have emphasized throughout this review, jets, unlike photons or electrons, are complex objects 
and the resultant reconstructed 4- vectors may depend on the details of the jet clustering algorithm. Each 
algorithm has its own strengths and weaknesses and a more robust understanding of the physics of an 
event can be obtained by examining the result of reconstruction with more than one jet algorithm |78j . 
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Figure 50: The inclusive jet cross section for the LHC with a PT,min value for the hard scattering of 
approximately 2 TeV/c, using several different jet algorithms with a distance scale {D = Rcone) of 0.7. 
The first bin has been suppressed. 

Past experience at the Tevatron has been that only one algorithm is typically used for any physics 
process, at least partially because of the limitations of the analysis machinery. In order to foster a 
more flexible experimental philosophy, a collection of jet routines was created (SpartyJet) [79] that can 
allow the reconstruction of the jets from either data or Monte Carlo using multiple algorithms/ varied 
parameters. The routine makes use of the FastKt package for the jet algorithm and the seedless 
cone algorithm SISCone in addition to other algorithms used by experiments at the Tevatron and LHC. 
The program can run either in the ROOT [5U] format, for example inside an ntuple, or it can run 
in stand-alone fashion on a collection of 4-vectors. As an example, in Fig. [501 is shown the results 
of running SpartyJet on a sample of Monte Carlo events generated for the ATLAS experiment. The 
sample consists of dijet events with a PT,min of approximately 2 TeV/c and the inputs to SpartyJet are 
the topological calorimeter clusters discussed in the previous section. The cone algorithms are run with 
a cone radius of 0.7 and a split/merge criterion of 0.75. The kx algorithms are run with a D parameter 
of 0.7 as well. The dijet character of the events can be seen by the clear peak at approximately 2 TeV/c, 
and the impact of hard gluon radiation off the initial and final states can be observed in the sizeable 
lower tail at lower transverse momenta. The jets found by the 2nd pass algorithm for the Midpoint 
algorithm can also be observed at the lower transverse momentum values. Note that on this plot the 
cone and algorithms with similar scale parameters give similar results for the cross section. Any 
differences need more detailed comparisons to become apparent. 

Another interesting variable to plot using SpartyJet is the distribution of jet masses. As discussed in 
Section [3731 the mass of typical QCD (gluon or light quark) jets is generated primarily by perturbative 
gluon emission. This is to be contrasted with a jet arising from a heavy quark (such as a top quark) that 
also has an intrinsic mass from the heavy quark. The distribution of jet masses from a typical QCD 
event sample (restricting the transverse momentum range to 1.8 — 2.2 TeV/c) is shown in Fig. [511 There 
is a Sudakov suppression of low jet masses, which can arise only if there is little or no gluon radiation 
from the short-distance final state partons. At jet masses above the peak (here at approximately 125 
GeV/c^) the jet mass distribution falls slowly, roughly between 1/m and 1/m^, with the average jet 
mass at a value above the peak (approximately 150 GeV/c^ in this sample). There is also a suppression 
of jets with large masses due to the tendency of the jet algorithms to split jets in which the energy is 
widely dispersed. Note that for these very high pt jet events that a jet mass of the order of 175 GeV/c^ 
(purely from gluon emission) is not uncommon, so this is a caveat for the naive use of the mass of a jet 
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Figure 51: The jet mass distributions for an inclusive jet sample generated for the LHC with a PT,min 
value for the hard scattering of approximately 2 TeV/c, using several different jet algorithms with a 
distance scale {D = Rcone) of 0.7. The first bin has been suppressed. 
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Figure 52: The y-scale distributions for the jets from the 1.8 — 2.2 TeV/c jet sample to be split into 
two subjets. Several different jet algorithms with a distance scale {D = Rcone) of 0.7 are used. 



to search for highly boosted top quarks. 

One can gain further information about the origin of a jet by examining the y-scale (using the 
kx algorithm) at which the jet can be split into two subjets. This ability has been implemented into 
SpartyJet by the use of the ^/-splitter routine [81]. This scale will tend to be larger for highly boosted 
massive objects (like a high pr W or top quark) than for QCD jets at the same transverse momentum. 
For example, for a boosted W, the y scale for resolving the VT-jet into two subjets should be on the 
order of my//pT,jeu while for jet structures produced by QCD radiation, the scales should be much 
smaller. The ?/-scale distribution for the jets from the 1.8 — 2.2 TeV/c jet sample to be split into two 
subjets is shown in Fig. [52l Low scales dominate as expected for a QCD jet sample. 

The average jet mass is plotted in Fig. [53] versus the transverse momentum of the jet for several 
jet algorithms for an inclusive jet Monte Carlo sample with transverse momenta from 100 GeV/c 

j/2 = niiii(p^ 2) ' where p^^andp^^ ^-re the transverse momentum values of the two subjets and ^2 is 

the distance scale at which the jet is divided into two subjets. 
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to 1 TeV/c. The average mass increases roughly hnearly with the jet transverse momentum. The 
reconstructed Monte Carlo jet mass values are in reasonable agreement with the NLO perturbative 
predictions discussed in Section 13.31 Thus, the jet mass, just as the jet shape, can be reasonably 
described by a NLO partonic calculation. The average Monte Carlo jet mass at high pt tends to approach 
the NLO prediction carried out with the use of an Rgep value of 1 (compared to the canonical value 
of 1.3), perhaps due to the impact of the very narrow jet profiles at these high transverse momenta. 
All of the jet algorithms result in a similar average jet mass, although JetClu tends to give larger 
results, due to the effects of ratcheting. As discussed in Section 13.31 the average jet mass is expected 
to scale as R ■ pt, where R is the size parameter for the jet. It is interesting to note that variations in 
theoretical (Rsep) and in experimental jet algorithms (ratcheting) have little impact on the magnitude 
of the inclusive jet cross section, but do have a noticeable effect on the jet mass. 
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Figure 53: The average jet mass is plotted versus the transverse momentum of the jet using several 
different jet algorithms with a distance scale [D = Rcone) of 0.7. 

A lego plot of a single event from the high px jet sample is shown in Fig. [5^ a). Again, there is a 
clear dijet structure in which the jets have very coUimated cores but the presence of such a large scale 
in the event results in there being a several extra jets of quite sizeable transverse momentum in their 
own right. This becomes more apparent when we change the transverse momentum scale in the lego 
plot as shown Fig. IMT b). Note that the jet colored in white is a 2nd pass jet, having been missed by 
the Midpoint algorithm in the first pass due to the effects discussed earlier. 

SpartyJet is currently in use by ATLAS, CDF and CMS. 



7 Conclusions 

Jets are present in nearly every final state measured at the Tevatron. This will be true as well at 
the LHC, with the environment for jet measurements being even more challenging. Thus, it is crucial 
both to improve our understanding of the subtleties of jet reconstruction as well as to continue the 
development of new tools. In this article, we have tried to remove some of the mystique regarding the 



63 



(a) (b) 

Figure 54: (a) A lego plot for an event from the inclusive jet sample generated for the LHC with a PT,min 
value for the hard scattering of approximately 2 TeV/c. (b) A lego plot for the same event; however, 
the pt scale has been cut off at 100 GeV/c to give a clear view of low pt jets. 



measurement of jets at hadron-hadron colliders by pointing out the connections between the theoretical 
predictions and the experimental measurements, and the similarities and differences between the cone 
and algorithms. In the process we have reviewed the history of jets at the Tevatron, including both 
the experimental and theoretical successes, and the surprises and mis-steps. Looking to the future 
we have outlined the issues expected to be important at the LHC. Specifically, we have discussed jet 
reconstruction and jet energy calibration experience at the Tevatron experiments and the on-going work 
at LHC experiments. LHC experiments benefit from the experience at the Tevatron (and also other 
experiments such as the HERA experiments), and, due to excellent capabilities of the ATLAS and CMS 
calorimeters, more advanced schemes are being explored such as the use of topological calorimeter cell 
clusters as inputs to jet clustering. An essential payoff of measurements performed at the Tevatron 
is the ability to accurately tune Monte Carlo event simulations to improve the modeling of jets. As 
we have discussed it is important that these experimental measurements be presented at the hadron 
level to facilitate the model tuning process. These measurements must also be repeated in the new 
environment of the LHC experiments. 

In this context of past experiences and future expectations we have made several recommendations 
that we feel will play an essential role in the successful analysis of the data from the LHC. These include: 

• the use of a variety of jet algorithms for physics analyses with continuous cross-checking of results 

• the use of 4-vector kinematics, including evaluation of the jet mass, to characterize a jet 

• the use of seedless algorithms (or correction back to seedless) in cone-based jet clustering 

• the correction (where possible) of jets back to the hadron level in experimental analyses 

In addition, we have presented a framework (SpartyJet) that facilitates the use of multiple jet algorithms 
in both experimental and theoretical studies. 

We close by applauding the 20 years of highly successful jet physics at the Tevatron and looking 
forward to an equally exciting application of jets to physics beyond the Standard Model at the LHC. 
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